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Description 

Background of the Invention 

[0001 ] The invention relates to methods for identifying 5 
nucleic acid sequences that modulate the function of a 
cell, by the use of posMranscriptional gene silencing. 
[0002] Double stranded RNA (dsRNA) has been 
shown to induce sequence-specific gene silencing in a 
number of different organisms. Gene silencing can oc- w 
cur through various mechanisms, one of which is post- 
transcriptional gene silencing (PTGS). In posMranscrip- 
tional gene silencing, transcription of the target locus is 
not affected, but the RNA half-life is decreased. The 
mechanisms by which PTGS occurs are not yet clear. 15 
Exogenous dsRNA has been shown to act as a potent 
inducer of PTGS in nematodes, trypanosomes, and in- 
sects. In addition, studies in C. elegans and Drosophila 
show that a few molecules of dsRNA per cell are suffi- 
cient to trigger a PTGS response. Furthermore, studies 20 
in mice have demonstrated that dsRNA can interfere 
with the expression of genes in mouse embryos. 
[0003] There exists a need to identify molecules that 
selectively regulate the expression of genes in verte- 
brate cells without the associated toxicity of the interfer- 
on response. Such regulation should allow the down- 
regulation of expression from genes whose gene prod- 
ucts are detrimental to the cells. 

Summary of the Invention 

[0004] In general the invention features high through- 
put methods of using PTGS to identify a nucleic acid 
sequence that modulates the function of a cell, gene ex- 
pression of a target nucleic acid, or the biological activity 
of a target polypeptide. The method involves the use of 
specially constructed cDNA libraries derived from a cell, 
for example, a primary cell or a cell line that has an ob- 
servable phenotype or biological activity, (e.g., an activ- 
ity mediated by a target polypeptide or altered gene ex- 
pression), that are transfected into cells to inhibit gene 
expression. This inhibition of gene expression alters the 
function of a cell, gene expression of a target nucleic 
acid, or the biological activity of a target polypeptide, 
and the nucleic acid sequence responsible for the mod- 
ulation can be readily identified. The method may also 
utilize randomized nucleic acid sequences or a given se- 
quence for which the function is not known. Although 
the use of PTGS as a validation strategy is known in the 
art, its use in screening techniques, as described herein, 
is novel. 

[0005] Accordingly, in a first aspect, the invention fea- 
tures a method for identifying a nucleic acid sequence 
that modulates the function of a cell. The method in- 
volves: (a) transforming a population of cells with a dou- 
ble stranded RNA expression library, where the library 
is derived from the cells, where at least two cells of the 
population of cells are each transformed with a different 


nucleic acid from the double stranded RNA expression 
library, and where the nucleic acid is capable of forming 
double stranded RNA; (b)optionally selecting for a cell 
in which the nucleic acid is expressed in the cell; and (c) 
assaying for a modulation in the function of the cell, 
wherein a modulation identifies a nucleic acid sequence 
that modulates the function of a cell. 
[0006] In a desirable embodiment of the first aspect 
of the invention, assaying for a modulation in the func- 
tion of a cell comprises measuring cell motility, apopto- 
sis, cell growth, cell invasion, vascularization, cell cycle 
events, cell differentiation, cell dedifferentiation, neuro- 
nal cell regeneration, or the ability of a cell to support 
viral replication. 

[0007] In a second aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates expression of a target nucleic acid in a cell. The 
method involves: (a) transforming a population of cells 
with a double stranded RNA expression library, where 
the library is derived from the cells, where at least two 
cells of the population of cells are each transformed with 
a different nucleic acid from the double stranded RNA 
expression library, and where the nucleic acid is capable 
of forming double stranded RNA; (b) optionally selecting 
for a cell in which the nucleic acid is expressed in the 
cell; and (c) assaying for a modulation in the expression 
of a gene in the cell, where a modulation identifies a 
nucleic acid sequence that modulates expression of a 
target nucleic acid in a cell. 

[0008] In a desirable embodiment of the second as- 
pect of the invention, the target nucleic acid is assayed 
using DNA array technology. 

[0009] In a third aspect, the invention features a meth- 
od for identifying a nucleic acid sequence that modu- 
lates the biological activity of a target polypeptide in a 
cell. The method involves: (a) transforming a population 
of cells with a double stranded RNA expression library, 
where the library is derived from the cells, where at least 
two cells of the population of cells are each transformed 
with a different nucleic acid from the double stranded 
RNA expression library, and where the nucleic acid is 
capable of forming double stranded RNA; (b)optionally 
selecting for a ceil in which the nucleic acid is expressed 
in the cell; and (c) assaying for a modulation in the bio- 
logical activity of a target polypeptide in the cell, wherein 
a modulation identifies a nucleic acid sequence that 
modulates the biological activity of a target polypeptide. 
[0010] In one embodiment of any of the above as- 
pects of the invention, in transforming step (a), the nu^ 
cleic acid is stably integrated into a chromosome of the 
cell. Integration of the nucleic acid may be random or 
site-specific. Desirably integration is mediated by re- 
combination or retroviral insertion. In addition, desirably 
a single copy of the nucleic acid is integrated into the 
chromosome. In another embodiment of any of the 
above aspects of the invention, in step (a) at least 50, 
more desirably 100; 500; 1000; 10,000; or 50,000 cells 
of the population of cells are each transformed with a 
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different nucleic acid from the double stranded RNA ex- 
pression library. In other embodiments, the population 
of cells is transformed with at least 5%, more desirably 
at least 25%, 50%, 75%, or 90%, and most desirably at 
least 95% of the double stranded RNA expression li- 
brary. In yet another embodiment, the method further 
involves: (d) identifying the nucleic acid sequence by 
amplifying and cloning the sequence. Desirably ampli- 
fication of the sequence involves the use of the polymer- 
ase chain reaction (PCR). 

[0011] In other embodiments of any of the above as- 
pects of the invention, the double stranded RNA expres- 
sion library contains cDNAs or randomized nucleic ac- 
ids. The double stranded RNA expression library may 
be a nuclear double stranded RNA expression library, 
in which case the double stranded nucleic acid is made 
in the nucleus. Alternatively, the double stranded RNA 
expression library may be a cytoplasmic double strand- 
ed RNA expression library, in which case the double 
stranded nucleic acid is made in the cytoplasm. In ad- 
dition, the nucleic acid from the double stranded RNA 
expression library may be made in vitro or in vivo. In 
addition, the identified nucleic acid sequence may be 
located in the cytoplasm of the cell. 
In still another embodiment of any of the above aspects 
of the invention, the nucleic acid is contained in a vector, 
for example a double stranded RNA expression vector. 
The vector may then be transformed such that it is stably 
integrated into a chromosomeof the cell, or it may func- 
tion as an episomal (non-integrated) expression vector 
within the cell. In one embodiment, a vector that is inte- 
grated into a chromosome of the cell contains a promot- 
er operably linked to a nucleic acid encoding ahairpin or 
double stranded RNA. In another embodiment, the vec- 
tor does not contain a promoter operably linked to a nu- 
cleic acid encoding a double stranded RNA. In thislater 
embodiment, the vector integrates into a chromosome 
of a cell such that an endogenous promoter is operably 
linked to a nucleic acid from the vector that encodes a 
double stranded RNA. Desirably, the double stranded 
RNA expression vector comprises at least one RNA 
polymerase II promoter, for example, a human CMV-im- 
mediate early promoter (HCMV-IE) or a simian CMV 
(SCMV) promoter, at least one RNA polymerase I pro- 
moter, or at least one RNA polymerase III promoter. The 
promoter may also be a T7 promoter, in which case, the 
cell further comprises T7 polymerase. Alternatively, the 
promoter may be an SP6 promoter, in which case, the 
cell further comprises SP6 polymerase. The promoter 
may also be one convergent T7 promoter and one con- 
vergent SP6 promoter. A cell may be made to contain 
T7 or SP6 polymerase by transforming the cell with a 
T7 polymerase or an SP6 polymerase expression plas- 
mid, respectively. In some embodiments, a T7 promoter 
or a RNA polymerase III promoter is operably linked to 
a nucleic acid that encodes a small double stranded 
RNA (e.g., a double stranded RNA that is less than 200, 
150, 100, 75, 50, or 25 nucleotides in length). In other 


embodiments, the promoter is a mitochondrial promoter 
that allows cytoplasmic transcription of the nucleic acid 
in the vector (see, for example, the mitochondrial pro- 
moters described in WO 00/63364, filed April 19, 2000). 
5 Alternatively, the promoter is an inducible promoter, 
such as a iac (Cronin et al. Genes & Development 15: 
1506-1517, 2001), ara (Khlebnikov et a/., J Bacterid. 
2000 Dec;182(24):7029-34), ecdysone (Rheogene, 
www.rheogene.com), RU48 (mefepristone) (corticos- 
10 teroid antagonist) (Wang XJ, Liefer KM, Tsai S, O'Malley 
BW, Roop DR, Proc Natl Acad Sci USA. 1999 Jul 20; 
96(15):8483-8), or tet promoter (Rendal et al, Hum 
Gene Ther. 2002 Jan; 13(2): 335-42. and Larnartina et 
al, Hum Gene Ther. 2002 Jan; 13(2): 199-210) or a pro- 
fs moter disclosed in WO 00/63364, filed April 19, 2000. 
In desirable embodiments, the inducible promoter is not 
induced until all the episomal vectors are eliminated 
from the cell. The vector may also comprise a selectable 
marker. In addition, these vectors may be used in com- 
20 bination with methods that inhibit or prevent an interfer- 
on response or double stranded RNA stress response, 
as described herein. 

[001 2] Desirably in a vector for use in any of the above 
aspects of the invention, the sense strand and the anti- 

25 sense strand of the nucleic acid sequence are tran- 
scribed from the same nucleic acid sequence using two 
convergent promoters. In another desirable embodi- 
ment, in a vector for use in any of the above aspects of 

the:inventionrthe nucleic acid sequence comprises an 

30 inverted repeat, such that upon transcription, the nucleic 
acid forms a double stranded RNA. 
[0013] In still other embodiments of any of the above 
aspects of the invention, the cell and the vector each 
further comprise a ioxP site and site-specific integration 

35 of the nucleic acid into a chromosome of the cell occurs 
through recombination between the ioxP sites. In addi- 
tion, step (b) of any of the above aspects of the invention 
further involves rescuing the nucleic acid through Cre- 
mediated double recombination. 

40 [001 4] In still further embodiments of any of the above 
aspects of the invention, the identified nucleic acid se- 
quence is located in the nucleus of the cell. Alternatively, 
the identified nucleic acid sequence may be located in 
the cytoplasm of the cell. 

45 [001 5] In yet another embodiment of any of the above 
aspects of the invention, the nucleic acid from the dou- 
ble stranded RNA expression library is at least 1 00, 500, 
600, or 1000 nucleotides in length. In other embodi- 
ments of any of the above aspects of the invention, the 

so nucleic acid from the double stranded RNA expression 
library is at least 10, 20, 30, 40, 50, 60, 70, 80, or 90 
nucleotides in length. In yet other embodiments, the 
number of nucleotides in the nucleic acid from the dou- 
ble stranded RNA expression library is between 5-100 

55 nucleotides, 15-100 nucleotides, 20-95 nucleotides, 
25-90 nucleotides, 35-85 nucleotides, 45-80 nucle- 
otides, 50-75 nucleotides, or 55-70 nucleotides, inclu- 
sive. In still other embodiments, the number of nucle- 
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otides in the nucleic acid from the double stranded RNA 
expression library is contained in one of the following 
ranges: 5-15 nucleotides, 15-20 nucleotides, 20-25 nu- 
cleotides, 25-35 nucleotides, 35-45 nucleotides, 45-60 
nucleotides, 60-70 nucleotides, 70-80 nucleotides, 
80-90 nucleotides, or 90-100 nucleotides, inclusive. In 
other embodiments, the nucleic acid contains less than 
50,000; 10,000; 5,000; or 2,000 nucleotides. In addition, 
the nucleic acid from the double stranded RNA expres- 
sion library may contain a sequence that is less than a 
full length RNA sequence. . 

[001 6] In still further embodiments of any of the above 
-aspects of the invention, the cell is a plant cell or an an- 
imal cell. Desirably the animal cell is a vertebrate or 
mammalian cell, for example, a human cell. The cell 
may be ex vivo or in vivo. The cell may be a gamete or 
a somatic cell, for example, a cancer cell, a stem cell, a 
cell of the immune system, a neuronal cell, a muscle 
cell, or an adipocyte. 

[0017] Transformation/transfection of the cell may oc- 
cur through a variety of means including, but not limited 
to, lipofection, DEAE-dextran-mediated transfection, 
microinjection, protoplast fusion, calcium phosphate 
precipitation, viral or retroviral delivery, electroporation, 
or biolistic transformation. The RNA or RNA expression 
vector (DNA) may be naked RNA or DNA or local anes- 
thetic complexed RNA or DNA (Pachuk et a/., Biochim. 
Biophys. Acta 1468:20-30, 2000). In another embodi- 
ment, the cell is not a C. elegans cell. Desirably the ver- 
tebrate or mammalian cell has been cultured for only a 
small number of passages (e.g., less than 30 passages 
of a cell line that has been directly obtained from Amer- 
ican Type Culture Collection), or are primary cells. De- 
sirably, the vertebrate or mammalian cell is transformed 
with nucleic acids that are not complexed with cationic 
lipids. 

[001 8] In yet another embodiment of any of the above 
aspects of the invention, the cell is derived from a parent 
cell, and is generated by: (a) transforming a population 
of parent cells with a bicistronic plasmid expressing a 
selectable marker and a reporter gene, and comprising 
a loxP site; (b) selecting for a cell in which the plasmid 
is stably integrated; and (c) selecting for a cell in which 
one copy of the plasmid is stably integrated in a tran- 
scriptionally active locus. Desirably the selectable mark- 
er is G418 and the reporter gene is green fluorescent 
protein (GFP). 

[0019] In still another embodiment of the above as- 
pects of the invention, generation of the double stranded 
expression library comprises: (a) isolating RNA from a 
cell; (b) synthesizing cDNAs from the RNA of step (a); 
and (c) cloning each cDNA into a vector. Desirably cD- 
NA synthesis is optimized and/or size selected for the 
generation and/or selection of cDNAs that are at least 
100, 500, 600, or 1000 nucleotides in length. In other 
embodiments, the cDNAs are least 10, 20, 30, 40, 50, 
60, 70, 80, or 90 nucleotides in length. In yet other em- 
bodiments, the number of nucleotides in the cDNAs is 


between 5-100 nucleotides, 15-100 nucleotides, 20-95 
nucleotides, 25-90 nucleotides, 35-85 nucleotides, 
45-80 nucleotides, 50-75 nucleotides, or 55-70 nucle- 
otides, inclusive. In still other embodiments, the number 
5 of nucleotides in the cDNAs is contained in one of the 
following ranges: 5-15 nucleotides, 15-20 nucleotides, 
20-25 nucleotides, 25-35 nucleotides, 35-45 nucle- 
otides, 45-60 nucleotides, 60-70 nucleotides, 70-80 nu- 
cleotides, 80-90 nucleotides, or 90-100 nucleotides, in- 
clusive. In other embodiments, the cDNAs contain less 
than 50,000; 1 0,000; 5,000; or 2,000 nucleotides. In ad- 
dition, the cDNA may encode an RNA fragment that is 
less than full length. Desirably the vector comprises two 
convergent T7 promoters, two convergent SP6 promot- 
ers, or one convergent T7 promoter and one convergent 
SP6 promoter, a selectable marker, and/or a loxP site. 
[0020] In an additional embodiment of any of the 
above aspects of the invention, the method is carried 
out under conditions that inhibit or prevent an interferon 
response or double stranded RNA stress response. 
[0021] In a fourth aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates the function of a cell, involving: (a) transforming 
a population of cells with a double stranded RNA that is 
derived from the cells; (b)optionally selecting for a cell 
in which the nucleic acid is expressed; and (c) assaying 
for a modulation in the function of the cell, wherein the 
modulation identifies a nucleic acid sequence that mod- 
ulates the function of a cell, wherein the method is de- 
30 sirably carried out under conditions that inhibit or pre- 
vent an interferon response or double stranded RNA 
stress response. 

[0022] In a desirable embodiment of the fourth aspect 
of the invention, assaying for a modulation in the func- 

35 tion of a cell comprises measuring cell motility, apopto- 
sis, cell growth, celt invasion, vascularization, cell cycle 
events, cell differentiation, cell dedifferentiation, neuro- 
nal cell regeneration, or the ability of a cell to support 
viral replication. 

40 [0023] In a fifth aspect, the invention features a meth- 
od for identifying a nucleic acid sequence that modu- 
lates expression of a target nucleic acid in a cell, involv- 
ing: (a) transforming a population of cells with a double 
stranded RNA that is derived from the cells; (b)optionally 

45 selecting for a cell in which the nucleic acid is expressed; 
and (c) assaying for a modulation in the expression of 
the gene in the cell, wherein the modulation identifies a 
nucleic acid sequence that modulates expression of a 
target nucleic acid in a cell, wherein the method is de- 

50 sirably carried out under conditions that inhibit or pre- 
vent an interferon response or double stranded RNA 
stress response. 

[0024] In a desirable embodiment of the fifth aspect 
of the invention, the target nucleic acid is assayed using 
55 DNA array technology. 

[0025] In a sixth aspect, the invention features a meth- 
od for identifying a nucleic acid sequence that modu- 
lates the biological activity of a target polypeptide in a 
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cell, Involving: (a) transforming a population of cells with 
a double stranded RNA that is derived from the cells; 
(b)optionally selecting for a cell in which the nucleic acid 
is expressed in the cell; and (c) assaying for a modula- 
tion in the biological activity of a target polypeptide in 
the cell, wherein the modulation identifies a nucleic acid 
sequence that modulates the biological activity of a tar- 
get polypeptide in a cell, wherein the method is desirably 
carried out under conditions that inhibit or prevent an 
interferon response or double stranded RNA stress re- 
sponse. 

[0026] In a seventh aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates the function of a cell, involving: (a) transforming 
a population of cells with a double stranded RNA; (b) 
optionally selecting for a cell in which the nucleic acid is 
expressed; and (c) assaying for a modulation in the 
function of the cell. Desirably, the modulation identifies 
a nucleic acid sequence that modulates the function of 
a cell, wherein the method is desirably carried out under 
conditions that or prevent an interferon response or dou- 
ble stranded RNA stress response. 
[0027] In a desirable embodiment of the seventh as- 
pect of the invention, assaying for a modulation in the 
function of a cell comprises measuring cell motility, ap- 
optosis, cell growth, cell invasion, vascularization, cell 
cycle events, cell differentiation, cell dedifferentiation, 
neuronal cell regeneration, or the ability of a cell to sup- 
port viral replication. 

[0028] In a eighth aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates expression of a target nucleic acid in a cell, in- 
volving: (a) transforming a population of cells with a dou- 
ble stranded RNA; (b)optionally selecting for a cell in 
which the nucleic acid is expressed; and (c) assaying 
for a modulation in the expression of the gene in the cell, 
wherein the modulation identifies a nucleic acid se- 
quence that modulates expression of a target nucleic 
acid in a cell. Desirably, the method is carried out under 
conditions that inhibit or prevent an interferon response 
or double stranded RNA stress response. 
[0029] In a desirable embodiment of the eighth aspect 
of the invention, the target nucleic acid is assayed using 
DNA array technology. 

[0030] In a ninth aspect, the invention features a 
method for identifying a nucleic acid sequence that mod- 
ulates the biological activity of a target polypeptide in a 
cell, involving: (a) transforming a population of cells with 
a double stranded RNA; (b)optionaily selecting for a cell 
in which the nucleic acid is expressed in the cell; and (c) 
assaying for a modulation in the biological activity of a 
target polypeptide in the cell, wherein the modulation 
identifies a nucleic acid sequence that modulates the 
biological activity of a target polypeptide in a cell. Desir- 
ably, the method is carried out under conditions that in- 
hibit or prevent an interferon response double stranded 
RNA stress response. 

[0031] In one embodiment of any of the above as- 


pects of the invention, in step (a) at least 2, more desir- 
ably 50; 100; 500; 1000; 10,000; or 50,000 cells of the 
population of cells are each transformed with a different 
double stranded RNA from a double stranded RNA ex- 

5 pression library. Desirably, at most one double stranded 
RNA is inserted into each cell. In other embodiments, 
the population of cells is transformed with at least 5%, 
more desirably at least 25%, 50%, 75%, or 90%, and 
most desirably, at least 95% of the double stranded RNA 

10 expression library. In still another embodiment of any of 
the fourth, fifth, or sixth aspects of the invention, the 
method further involves: (dj identifying the nucleic acid 
sequence by amplifying and cloning the sequence. De- 
sirably amplification of the sequence involves the use of 

15 the polymerase chain reaction (PCR). 

[0032] In a tenth aspect, the invention features a cell 
or a population of cells that expresses a double stranded 
RNA that (i) modulates a function of the cell, (ii) modu- 
lates the expression of a target nucleic acid (e.g., an en- 

20 dogenous or pathogen gene) in the cell, and/or (iii) mod- 
ulates the biological activity of a target protein (e.g., an 
endogenous or pathogen protein) in the cell. Desirably, 
the cell contains only one molecular species of double 
stranded RNA or only one copy of a double stranded 

25 RNA expression vector (e.g., a stably integrated vector). 
Desirably, the cell or population of cells is produced us- 
ing one or more methods of the invention. In other em- 
bodiments, the double stranded RNA is expressed un- 
der conditions that inhibit or prevent an interferon re- 

30 sponse or a double stranded RNA stress response 
[0033] In other embodiments of any of the fourth, fifth, 
sixth, seventh, eighth, ninth, or tenth aspects of the in- 
vention, the double stranded RNA is derived from cD- 
NAs or randomized nucleic acids. In addition, the double 

35 stranded RNA may be a cytoplasmic double stranded 
RNA, in which case the double stranded nucleic acid is 
made in the cytoplasm. The double stranded RNA may 
be made in vitro or in vivo. In addition, the identified nu- 
cleic acid sequence may be located in the cytoplasm of 

40 the cell. 

[0034] In still another embodiment of any of the fourth, 
fifth, sixth, seventh, eighth, ninth, or tenth aspects of the 
invention, the nucleic acid is contained in a vector, for 
example, a double stranded RNA expression vector that 

45 is capable of forming a double stranded RNA. Desirably 
the double stranded RNA expression vector comprises 
at least one promoter. The promoter may be a T7 pro- 
moter, in which case, the cell further comprises T7 
polymerase. Alternatively, the promoter may be an SP6 

so promoter, in which case, the cell further comprises SP6 
polymerase. The promoter may also be one convergent 
T7 promoter and one convergent SP6 promoter. A cell 
may be made to contain T7 or SP6 polymerase by trans- 
forming the cell with a T7 polymerase or an SP6 

55 polymerase expression plasmid, respectively. The vec- 
tor may also comprise a selectable marker, for example 
hygromycin. 

[0035] Desirably in a vector for use in any of the fourth, 
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fifth, sixth, seventh, eighth, ninth, or tenth aspects of the 
invention, the sense strand and the antisense strand of 
the nucleic acid sequence are transcribed from the 
same nucleic acid sequence using two convergent pro- 
moters. In another desirable embodiment, in a vector for 
use in any of the above aspects of the invention, the 
nucleic acid sequence comprises an inverted repeat, 
such that upon transcription, the nucleic acid forms a 
double stranded RNA. 

[0036] In yet another embodiment of any of the fourth, 
fifth, sixth, seventh, eighth, ninth, or tenth aspects of the 
invention, the double stranded RNA is at least 1 00, 500, 
600, or 1000 nucleotides~in _ length. In other embodi- 
ments of any of the fourth, fifth, or sixth aspects of the 
invention, the double stranded RNA is at least 10, 20, 
30, 40, 50, 60, 70, 80, or 90 nucleotides in length. In yet 
other embodiments, the number of nucleotides in the 
double stranded RNA is between 5-100 nucleotides, 
15-100 nucleotides, 20-95 nucleotides, 25-90 nucle- 
otides, 35-85 nucleotides, 45-80 nucleotides, 50-75 nu- 
cleotides, or 55-70 nucleotides, inclusive. In still other 
embodiments, the number of nucleotides in the double 
stranded RNA is contained in one of the following rang- 
es: 5-15 nucleotides, 15-20 nucleotides, 20-25 nucle- 
otides, 25-35 nucleotides, 35-45 nucleotides, 45-60 nu- 
cleotides, 60-70 nucleotides, 70-80 nucleotides, 80-90 
nucleotides, or 90-100 nucleotides, inclusive. In other 
embodiments, the double stranded RNA contains less 
than 50,000; 10,000; 5,000; or 2,000 nucleotides. In ad- 
dition, the double stranded RNA may contain a se- 
quence that is less than a full length RNA sequence. 
[0037] In still further embodiments of any of the fourth, 
fifth, sixth, seventh, eighth, ninth, or tenth aspects of the 
invention, the cell is a plant cell or an animal cell. Desir- 
ably the animal cell is a vertebrate or mammalian cell, 
for example, a human cell. The cell may be ex vivo or in 
vivo. The cell may be a gamete or a somatic cell, for 
example, a cancer cell, a stem cell, a cell of the immune 
system, a neuronal cell, a muscle cell, or an adipocyte. 
[0038] In other embodiments of any of the first, sec- 
ond, third, seventh, eighth, ninth, or tenth aspects of the 
invention, the double stranded RNA is derived from a 
cell or a population of cells and used to transform an- 
other cell population of either the same cell type or a 
different cell type. In desirable embodiments, the trans- 
formed cell population contains cells of a cell type that 
is related to the cell type of the cells from which the dou- 
ble stranded RNA was derived (e.g., the transformation 
of cells of one neuronal cell type with the double strand- 
ed RNA derived from cells of another neuronal cell type). 
In yet other embodiments of any of these aspects, the 
double stranded RNA contains one or more contiguous 
or non-contiguous positions that are randomized (e.g., 
by chemical or enzymatic synthesis using a mixture of 
nucleotides that may be added at the randomized posi- 
tion). In still other embodiments, the double stranded 
RNA is a randomized nucleic acid in which segments of 
ribonucleotides and/or deoxyribonucleotides are ligated 


to form the double stranded RNA. 
[0039] In other embodiments of any of various as- 
pects of the invention, the double stranded RNA specif- 
ically hybridizes to a target nucleic acid but does not 

5 substantially hybridize to non-target molecules, which 
include other nucleic acids in the cell or biological sam- 
ple having a sequence that is less than 99, 95, 90, 80, 
or 70% identical or complementary to that of the target 
nucleic acid. Desirably, the amount of the these non-tar- 

10 get molecules hybridized to, or associated with, the dou- 
ble stranded RNA, as measured using standard assays, 
is 2-fold, desirably 5-fold, more desirably 10-fold, and 
most desirably 50-fold lower than the amount of the tar- 
get nucleic acid hybridized to, or associated with, the 

is double stranded RNA. In other embodiments, the 
amount of a target nucleic acid hybridized to, or associ- 
ated with, the double stranded RNA, as measured using 
standard assays, is 2-fold, desirably 5-fold, more desir- 
ably 1 0-fold, and most desirably 50-fold greater than the 

20 amount of a control nucleic acid hybridized to, or asso- 
ciated with, the double stranded RNA. Desirably, the 
double stranded RNA only hybridizes to one target nu- 
cleic acid from a cell under denaturing, high stringency 
hybridization conditions. In certain embodiments, the 

25 double stranded RNA is substantially homologous (e.g., 
at least 80, 90, 95, 98,or 100% homologous) to only one 
target nucleic acid from a cell. In other embodiments, 
the double stranded RNA is homologous to multiple 
RNAs, such as RNAs from the same gene family. In yet 

30 other embodiments, the double stranded RNA is homol- 
ogous to distinctly different mRNA sequences from 
genes that are similarly regulated (e.g., developmental, 
chromatin remodeling, or stress response induced). In 
other embodiments, the double stranded RNA is homol- 

35 ogous to a large number of RNA molecules, such as a 
double stranded RNA designed to induce a stress re- 
sponse or apoptosis. In other embodiments, the percent 
decrease in the expression of a target nucleic acid is at 
least 2, 5, 10, 20, or 50 fold greater than the percent 

40 decrease in the expression of a non-target or control nu- 
cleic acid. Desirably, the double stranded RNA inhibits 
the expression of a target nucleic acid but has negligible, 
if any, effect on the expression of other nucleic acids in 
the cell. Examples of control nucleic acids include nu- 

45 cleic acids with a random sequence or nucleic acids 
known to have little, if any, affinity for the double strand- 
ed RNA. 

[0040] In other embodiments of any of various as- 
pects of the invention, at most one molecular species of 

50 double stranded RNA is inserted into each cell. In other 
embodiments, at most one vector is stably integrated 
into the genome of each cell. In various embodiments, 
the double stranded RNA is active in the nucleus of the 
transformed cell and/or is active in the cytoplasm of the 

55 transformed cell. In various embodiments, at least 1,10, 
20, 50, 100, 500, or 1000 cells or ail of the cells in the 
population are selected as cells that contain or express 
a double stranded RNA. In some embodiments, at least 
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1,10, 20, 50, 100, 500, or 1000 cells or all of the cells 
in the population are assayed for a modulation in the 
function of the cell, a modulation in the expression of a 
target nucleic acid (e.g., an endogenous or pathogen 
gene) in the cell, and/or a modulation in the biological 
activity of a target protein (e.g., an endogenous or path- 
ogen protein) in the cell. 

[0041] In other embodiments, the double stranded 
RNA or double stranded RNA expression vector is com- 
plexed with one or more cationic lipids or cationic am- 
phiphiies, such as the compositions disclosed in US 
4,897,355 (Eppstein etal., filed October 29, 1987), US 
5,264,618 (Feigner et a/., filed April 16, 1991) or US 
5,459,127 (Feigner et at., filed September 16, 1993). In 
other embodiments, the double stranded RNA or double 
stranded RNA expression vector is complexed with a 
liposomes/liposomic composition that includes a cation- 
ic lipid and optionally includes another component such 
as a neutral lipid (see, for example, US 5,279,833 
(Rose), US 5,283,185 (Epand), and US 5,932,241). In 
yet other embodiments, the double stranded RNA or 
double stranded RNA expression vector is complexed 
with any other composition that is devised by one of or- 
dinary skill in the fields of pharmaceutics and molecular 
biology. 

[0042] Desirably, the double stranded RNA specifical- 
ly hybridizes to a target nucleic acid but does not sub- 
stantially hybridize to non-target molecules, which in- 
clude other nucleic acids in the cell or biological sample 
having a sequence that is less than 99, 95, 90, 80, or 
70 % identical to or complementary to that of the target 
nucleic acid. In other embodiments, the percent de- 
crease in the expression of a target nucleic acid is at 
least 2, 5, 10, 20, or 50 fold greater than the percent 
decrease in the expression of a non-target or control nu- 
cleic acid. Desirably, the double stranded RNA inhibits 
the expression of the target nucleic acid but has negli- 
gible, if any, effect on the expression of other nucleic 
acids in the cell. 

[0043] Transf ormation/transf ection of the cell may oc- 
cur through a variety of means including, but not limited 
to, lipofection, DEAE-dextran-mediated transfection, 
microinjection, protoplast fusion, calcium phosphate 
precipitation, viral or retroviral delivery, electroporation, 
or biolistic transformation. The RNA or RNA expression 
vector (DNA) may be naked RNA or DNA or local anes- 
thetic complexed RNA or DNA (Pachuk et ai, supra). In 
yet another embodiment, the cell is not a C. efegansceW. 
Desirably the vertebrate or mammalian cell has been 
cultured for only a small number of passages (e.g., less 
than 30 passages of a cell line that has been directly 
obtained from American Type Culture Collection), or are 
primary cells. In addition, desirably the vertebrate or 
mammalian cell is transformed with double stranded 
RNA that is not complexed with cationic lipids. 
[0044] The transcription systems described herein 
provide advantages to other double stranded expres- 
sion systems. Following transformation of the double 


stranded RNA library, cells contain hundreds to thou- 
sands of double stranded RNA expression cassettes, 
with concomitant expression of that many expression 
cassettes. In the double stranded RNA expression sys- 
tem of the present invention, double stranded RNA 
(dsRNA) expression cassettes contained within the ex- 
pression vector integrate into the chromosome of the 
transfected cell. Desirably, every transformed cell inte- 
grates one of the double stranded expression cassettes, 
i Through expansion of the transformed cell, episomal 
(non-integrated) expression vectors are diluted out of 
the cell over time. Desirably no transcription occurs until 
the episomal expression vectors are diluted out of the 
cell, such that not more than 5 episomal vectors remain 
5 in the cell. Most desirably, no transcription occurs until 
ail of the episomal vectors have been diluted out of the 
cell, and only the integrated expression cassette re- 
mains. The time it takes for all episomal vectors to be 
removed from the cell is proportional to the replication 
o rate of the transformed cell, and is generally on the order 
of two to several weeks of cell culture and growth. The 
numbers of copies of a dsRNA molecule in a trans- 
formed cell can be determined using, for example, 
standard PCR techniques, and thereby, the number of 
*s episomal vectors in a given cell can be monitored. 
[0045] Once a stable integrant containing five or few- 
er, and desirably no episomal expression vectors, tran- 
scription is induced, allowing dsRNA to be expressed in 
the cells. This method ensures that, if desired, only one 
?o species or not more than about five species of dsRNA 
is expressed per cell, as opposed to other methods that 
express hundreds to thousands of double stranded spe- 
cies. 

[0046] Another problem that can occur in other double 
35 stranded expression systems or dsRNA delivery sys- 
tems is that some dsRNA sequences, possibly in certain 
cell types and through certain delivery methods, may re- 
sult in an interferon response (Jaramillo et a/., Cancer 
Invest. 13:327-338, 1995). During the induction of post- 
40 transcriptional gene silencing events, induction of an in- 
terferon response is not desired, as this could lead to 
ceil death and possibly to the prevention of gene silenc- 
ing. An additional advantage of the present invention is 
that the dsRNA delivery methods described herein are 
45 performed such that an interferon response is inhibited 
or prevented. 

[0047] One of the components of an interferon re- 
sponse is the induction of the interferon-induced protein 
kinase PKR (Jaramillo et a/., supra). Suppression of the 

so interferon response and/or the PKR response, using 
techniques described herein, is desired in the cells tar- 
geted for a PTGS event in those instances where an 
interferon response would otherwise be induced. Meth- 
ods for suppressing an interferon response or dsRNA 

55 stress response can be used in combination with any of 
the methods for identifying a nucleic acid sequence that 
modulates the function of a cell, gene expression in a 
cell, or the biological activity of a target polypeptide. 


13 


EP1 229134 A2 


14 


[0048] The methods of the present invention provide 
a means for high throughput identification of nucleic acid 
sequences involved in modulating the function of a cell, 
the expression of a target nucleic acid in a cell, or the 
biological activity of a target polypeptide in a cell. By 
transforming a population of cells with a double stranded 
RNA expression library, the effects of many PTGS 
events on cell function, expression of a target nucleic 
acid in a cell, or the biological activity of a target polypep- 
tide in a cell can be evaluated simultaneously, thereby 
allowing for rapid identification of the nucleic acid se- 
quence involved in a cell function, target nucleic acid 
expression,^ biological activity of a target polypeptide 
of interest. 

[0049] By "nucleic acid," "nucleic acid sequence," 
"double stranded RNA nucleic acid sequence," or "dou- 
ble stranded RNA nucleic acid" is meant a nucleic acid 
or a portion thereof that is free of the genes that, in the 
naturally-occurring genome of the organism from which 
the nucleic acid sequence of the invention is derived, 
flank the gene. The term therefore includes, for exam- 
ple, a recombinant DNA, with or without 5' or 3' flanking 
sequences that is incorporated into a vector, for exam- 
ple, a double stranded RNA expression vector; into an 
autonomously replicating plasmid or virus; or into the 
genomic DNA of a prokaryote or eukaryote; or which ex- 
ists as a separate molecule {e.g., a cDNA or a genomic 
or cDNA fragment produced by PCR or restriction en-, 
donuclease digestion) independent of other sequences. 
[0050] By "double stranded RNA" is meant a nucleic 
acid containing a region of two or more nucleotides that 
are in a double stranded conformation. In various em- 
bodiments, the double stranded RNA consists entirely 
of ribonucleotides or consists of a mixture of ribonucle- 
otides and deoxynucleotides, such as the RNA/DNA hy- 
brids disclosed, for example, by WO 00/63364, filed 
April 19, 2000 or U.S.S.N. 60/130,377, filed April 21, 
1999. The double stranded RNA may be a single mole- 
cule with a region of self-complimentarity such that nu- 
cleotides in one segment of the molecule base pair with 
nucleotides in another segment of the molecule. In var- 
ious embodiments, a double stranded RNA that consists 
of a single molecule consists entirely of ribonucleotides 
or includes a region of ribonucleotides that is compli- 
mentary to a region of deoxyribonucleotides. Alterna- 
tively, the double stranded RNA may include two differ- 
ent strands that have a region of complimentarity to 
each other. In various embodiments, both strands con- 
sist entirely of ribonucleotides, one strand consists en- 
tirely of ribonucleotides and one strand consists entirely 
of deoxyribonucleotides, or one or both strands contain 
a mixture of ribonucleotides and deoxyribonucleotides. 
Desirably, the regions of complimentarity are at least 70, 
80, 90, 95, 98, or 100% complimentary. Desirably, the 
region of the double stranded RNA that is present in a 
double stranded conformation includes at least 5, 10, 
20, 30, 50, 75,1 00, 200, 500, 1 000, 2000 or 5000 nucle- 
otides or includes all of the nucleotides in a cDNA being 


represented in the double stranded RNA. In some em- 
bodiments, the double stranded RNA does not contain 
any single stranded regions, such as single stranded 
ends, or the double stranded RNA is a hairpin. Desirable 
. 5 RNA/DNA hybrids include a DNA strand or region that 
is an antisense strand or region (e.g, has at least 70, 80, 
90, 95, 98, or 100% complimentary to a target nucleic 
acid) and an RNA strand or region that is an sense 
strand or region (e.g, has at least 70, 80, 90, 95, 98, or 

10 100% identity to a target nucleic acid). In various em- 
bodiments, the RNA/DNA hybrid is made in vitro using 
enzymatic or chemical synthetic methods such as those 
described herein or those described in WO 00/63364, 
filed April 19, 2000 or U.S.S.N. 60/130,377, filed April 

15 21 , 1999. In other embodiments, a DNA strand synthe- 
sized in vitro is complexed with an RNA strand made in 
vivo or in vitro before, after, or concurrent with the trans- 
formation of the DNA strand into the cell. In yet other 
embodiments/the double stranded RNA is a single cir- 

20 cular nucleic acid containing a sense and an antisense 
region, or the double stranded RNA includes a circular 
nucleic acid and either a second circular nucleic acid or 
a linear nucleic acid (see, for example, WO 00/63364, 
filed April 19, 2000 or U.S.S.N. 60/130,377, filed April 

25 21 , 1 999.) Exemplary circular nucleic acids include lariat 
structures in which the free 5* phosphoryl group of a nu- 
cleotide becomes linked to the 2' hydroxyl group of an- 
other nucleotide in a loop back fashion. 
[0051] In other embodiments, the double stranded 

30 RNA includes one or more modified nucleotides in which 
the 2' position in the sugar contains a halogen (such as 
flourine group) or contains an alkoxy group (such as a 
methoxy group) which increases the half-life of the dou- 
ble stranded RNA in vitro or in vivo compared to the cor- 

35 responding double stranded RNA in which the corre- 
sponding 2' position contains a hydrogen or an hydroxyl 
group. In yet other embodiments, the double stranded 
RNA includes one or more linkages between adjacent 
nucleotides other than a naturally-occurring phosphodi- 

40 ester linkage. Examples of such linkages include phos- 
phoramide, phosphorothioate, and phosphorodithioate 
linkages. In other embodiments, the double stranded 
RNA contains one or two capped strands, as disclosed, 
for example, by WO 00/63364, filed April 1 9, 2000 or U. 

45 S.S.N. 60/130,377, filed April 21 , 1999. In other embod- 
iments, the double stranded RNA contains coding se- 
quence or non-coding sequence, for example, a regu- 
latory sequence (e.g., a transcription factor binding site, 
a promoter, or a 5* or 3' untranslated region (UTR) of an 

so mRNA). Additionally, the double stranded RNA can be 
any of the at least partially double-stranded RNA mole- 
cules disclosed in WO 00/63364, filed April 19, 2000 
(see, for example, pages 8-22). Any of the double 
stranded RN As may be expressed in vitro or in vivo us- 
55 ing the methods described herein or standard methods, 
such as those described in WO O0/63364, filed April 19, 
2000 (see, for example, pages 16-22). 
[0052] By -double stranded RNA expression library" 
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or "dsRNA expression library" is meant a collection of 
nucleic acid expression vectors containing nucleic acid 
sequences, for example, cDNA sequences or rand- 
omized nucleic acid sequences that are capable of form- 
ing a double stranded RNA (dsRNA) upon expression 
of the nucleic acid sequence. Desirably the double 
stranded RNA expression library contains at least 
10,000 unique nucleic acid sequences, more desirably 
at least 50,000; 1 00,000; or 500,000 unique nucleic acid 
sequences, and most desirably, at least 1,000,000 
unique nucleic acid sequences. By a "unique nucleic ac- 
id sequence" is meant that a nucleic acid sequence of 
a double stranded RNA expression library has desirably 
less than 50%, more desirably less than 25% or 20%, 
and most desirably less than 10% nucleic acid identity 
to another nucleic acid sequence of a double stranded 
RNA expression library when the full length sequence 
are compared. Sequence identity is typically measured 
using sequence analysis software with the default pa- 
rameters specified therein (e.g., Sequence Analysis 
Software Package of the Genetics Computer Group, 
University of Wisconsin Biotechnology Center, 1710 
University Avenue, Madison, Wl 53705). This software 
program matches similar sequences by assigning de- 
grees of homology to various substitutions, deletions, 
and other modifications. 

[0053] The preparation of cDNAs for the generation 
of double stranded RNA expression libraries is de- 
" scribed herein. A randomized nucleic acid library may 
also be generated as described in detail below. The dou- 
ble stranded RNA expression library may contain nucle- 
ic acid sequences that are transcribed in the nucleus or 
that are transcribed in the cytoplasm of the cell. A double 
stranded RNA expression library may be generated us- 
ing techniques described herein. 
[0054] By "target nucleic acid" is meant a nucleic acid 
sequence whose expression is modulated as a result of 
post-transcriptional gene silencing. As used herein, the 
target nucleic acid may be in the cell in which the PTGS 
event occurs or it may be in a neighboring cell, or in a 
cell contacted with media or other extracellular fluid in 
which the celt that has undergone the PTGS event is 
contained. Exemplary target nucleic acids include nu- 
cleic acids associated with cancer or abnormal cell 
growth, such as oncogenes, and nucleic acids associ- 
ated with an autosomal dominant or recessive disorder. 
Desirably, the double stranded RNA inhibits the expres- 
sion of an allele of a nucleic acid that has a mutation 
associated with a dominant disorder and does not sub- 
stantially inhibit the other allele of the nucleic acid (e.g, 
an allele without a mutation associated with the disor- 
der). Other exemplary target nucleic acids include host 
cellular nucleic acids or pathogen nucleic acids required 
for the infection or propagation of a pathogen, such as 
a virus, bacteria, yeast, protozoa, or parasite. 
[0055] By "target polypeptide" is meant a polypeptide 
whose biological activity is modulated as a result of post- 
transcriptional gene silencing. As used herein, the target 


polypeptide may be in the cell in which the PTGS event 
occurs or it may be in a neighboring cell, or in a cell con- 
tacted with media or other extracellular fluid in which the 
cell that has undergone the PTGS event is contained. 

5 [0056] As used herein, by "randomized nucleic acids" 
is meant nucleic acids, for example, those that are at 
least 1 00, 500, 600, or 1 000 nucleotides in length, con- 
structed from RNA isolated from a particular cell type. 
In other embodiments, the nucleic acids are at least 10, 

10 20, 30, 40, 50, 60, 70, 80, or 90 nucleotides in length. 
In yet other embodiments, the number of nucleotides in 
the nucleic acids is between 5-100 nucleotides, 15-100 
nucleotides,- .20-95 nucleotides, 25-90 nucleotides, 
35-85 nucleotides, 45-80 nucleotides, 50-75 nucle- 
us otides, or 55-70 nucleotides, inclusive. In still other em- 
bodiments, the number of nucleotides in the nucleic ac- 
ids is contained in one of the following ranges: 5-15 nu- 
cleotides, 15-20 nucleotides, 20-25 nucleotides, 25-35 
nucleotides, 35-45 nucleotides, 45-60 nucleotides, 

20 60-70 nucleotides, 70-80 nucleotides, 80-90 nucle- 
otides, or 90-1 00 nucleotides, inclusive. In other embod- 
iments, the nucleic acids contain less than 50,000; 
10,000; 5,000; or 2,000 nucleotides. A randomized nu- 
cleic acid library may be constructed in a number of 

25 ways. For example, it may be constructed from existing 
cDNA libraries. In one example, the cDNA libraries are 
shuffled using the "Gene Shuffling" technology of Max- 
ygen Corp. The cDNA sequences are amplified using 
inefficient PCR either by restricting elongation time or 

30 through the use of manganese. A library of recom- 
binants is created, and the library is finally amplified by 
PCR and cloned into vectors. In a second method, ex- 
isting cDNA libraries are digested with an endonuclease 
to generate fragments of 10 to 300 base pairs. Alterna- 

35 lively, the cDNA libraries are digested to generate short- 
er fragments of, for example, 5 to 50 base pairs, 5 to 40 
base pairs, 5 to 20 base pairs, 5 to 1 0 base pairs, or 1 0 
to 20 base pairs, inclusive. If the fragments are to con- 
tain 5' OH and 3' P0 4 groups, they are dephosphorylat- 

40 ed using alkaline phosphatase and phosphorylated us- 
ing polynucleotide kinase. These dsDNA fragments are 
then ligated to form larger molecules, and are size se- 
lected. In a third example, randomized nucleic acid li- 
braries are created by using random priming of cDNA 

45 libraries (using random hexamers and Klenow) to gen- 
erate short fragments of 20 to 100 nucleotides. Alterna- 
tively, shorter fragments are generated that contain, for 
example, 5 to 50 nucleotides, 5 to 40 nucleotides, 5 to 
20 nucleotides, 5 to 10 nucleotides, or 10 to 20 nucle- 

50 otides, inclusive. These fragments are then ligated ran- 
domly to give a desired sized larger fragment. 
[0057] Alternatively, a randomized nucleic acid library 
can be generated from random sequences of oligonu- 
cleotides. For example, DNA or RNA oligonucleotides 

55 may be prepared chemically. Random DNA sequences 
may also be prepared enzymatically using terminal 
transferase in the presence of all dNTPs. Random RNA 
molecules may be prepared using NDPs and NDP phos- 
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phorytase. The random sequences may be 10 to 300 
bases in length. Alternatively, shorter random sequenc- 
es are used that contain, for example, 5 to 50 bases, 5 
to 40 bases, 5 to 20 bases, 5 to 10 bases, or 10 to 20 
bases, inclusive. The sequences are ligated to form the 
desired larger sequence using RNA ligase. Alternatively 
these sequences may be ligated chemically. The oligo- 
nucleotides are phosphorylated at the 5' position using 
polynucleotide kinase or by chemical methods, prior to 
ligation enzymatically. Chemical ligations can utilize a 5' 
P0 4 and a 3* OH group or a5' OH and a 3' P0 4 group. 
[0058] Alternatively, a randomized nucleic acid library 
can be generated by converting the random DNA se- 
quences into dsDNA sequences using DNA polymerase 
(Klenow), dNTP and random heteromeric primers, and 
the RNA sequences are converted into dsDNA se- 
quences by reverse transcriptase and Klenow. After 
converting into DNA (ss or ds) the sequences are then 
amplified by PCR. The dsDNA fragments can also be 
ligated to give larger fragments of a desired size. 
[0059] The randomized nucleic acids may be cloned 
into a vector, for example, an expression vector, as a 
double stranded RNA transcription cassette. The se- 
quence of the nucleic acid may not be known at the time 
the vector is generated. The randomized nucleic acid 
may contain coding sequence or non-coding sequence, 
for example, a regulatory sequence (e.g., a transcription 
factor binding site, a promoter, or a 5' or 3' untranslated 
region (UTR)of an mRNA). 

[0060] By "Cre-mediated double recombination" is 
meant two nucleic acid recombination events involving 
/oxPsites that are mediated by Cre recombinase. A Cre- 
mediated double recombination event can occur, for ex- 
ample, as illustrated in Fig. 1 . 
[0061] By "function of a cell" is meant any cell activity 
that can be measured or assessed. Examples of cell 
function include, but are not limited to, cell motility, ap- 
optosis, cell growth, cell invasion, vascularization, cell 
cycle events, cell differentiation, cell dedifferentiation, 
neuronal cell regeneration, and the ability of a cell to 
support viral replication. The function of a cell may also 
be to affect the function, gene expression, or the 
polypeptide biological activity of another cell, for exam- 
ple, a neighboring cell, a cell that is contacted with the 
cell, or a cell that is contacted with media or other ex- 
tracellular fluid that the cell is contained in. 
[0062] By "apoptosis" is meant a cell death pathway 
wherein a dying cell displays a set of well-characterized 
biochemical hallmarks that include cytolemmal mem- 
brane blebbing, cell soma shrinkage, chromatin con- 
densation, nuclear disintegration, and DNA laddering. 
There are many well-known assays for determining the 
apoptotic state of a cell, including, and not limited to: 
reduction of MTT tetrazolium dye, TUNEL staining, An- 
nexin V staining, propidium iodide staining, DNA ladder- 
ing, PARP cleavage, caspase activation, and assess- 
ment of cellular and nuclear morphology. Any of these 
or other known assays may be used in the methods of 


the invention to determine whether a cell is undergoing 
apoptosis. 

[0063] By "polypeptide biological activity" is meant the 
ability of a target polypeptide to modulate ceil function. 

5 The level of polypeptide biological activity may be direct- 
ly measured using standard assays known in the art. 
For example, the relative level of polypeptide biological 
activity may be assessed by measuring the level of the 
mRNA that encodes the target polypeptide (e.g., by re- 

w verse transcription-polymerase chain reaction 
(RT-PCR) amplification or Northern blot analysis); the 
level of target polypeptide (e.g., by ELISA or Western 
blot analysis); the activity of a reporter gene under the 
transcriptional regulation of a target polypeptide tran- 

15 scriptional regulatory region (e.g., by reporter gene as- 
say, as described below); the specific interaction, of a 
target polypeptide with another molecule, for example, 
a polypeptide that is activated by the target polypeptide 
or that inhibits the target polypeptide activity (e.g., by 

20 the two-hybrid assay); or the phosphorylation or glyco- 
sylate state of the target polypeptide. A compound, 
such as a dsRNA, that increases the level of the target 
polypeptide, mRNA encoding the target polypeptide, or 
reporter geneactivity within a cell, a cell extract, or other 

25 experimental sample is a compound that stimulates or 
increases the biological activity of a target polypeptide. 
A compound, such as a dsRNA, that decreases the level 
of the target polypeptide, mRNA encoding the target 
polypeptide, or reporter gene activity within a cell, a cell 

30 extract, or other experimental sample is a compound 
that decreases the biological activity of a target polypep- 
tide. 

[0064] By "assaying" is meant analyzing the effect of 
a treatment, be it chemical or physical, administered to 
35 whole animals, cells, tissues, or molecules derived 
therefrom. The material being analyzed may be an an- 
imal, a cell, a tissue, a lysate or extract derived from a 
cell, or a molecule derived from a cell. The analysis may 
be, for example, for the purpose of detecting altered cell 
40 function, altered gene expression, altered endogenous 
RNA stability, altered polypeptide stability, altered 
polypeptide levels, or altered polypeptide biological ac- 
tivity. The means for analyzing may include, for exam- 
ple, antibody labeling, immunoprecipitation, phosphor- 
us ylation assays, glycosylation assays, and methods 
known to those skilled in the art for detecting nucleic 
acids. In some embodiments, assaying is conducted un- 
der selective conditions. 

[0065] By "modulates" is meant changing, either by a 
so decrease or an increase. As used herein, desirably a 
nucleic acid sequence decreases the function of a cell, 
the expression of a target nucleic acid in a cell, or the 
biological activity of a target polypeptide in a cell by least 
20%, more desirably by at least 30%, 40%, 50%, 60% 
55 or 75%, and most desirably by at least 90%. Also as 
used herein, desirably a nucleic acid sequence increas- 
es the function of a cell, the expression of a target nu- 
cleic acid in a cell, or the biological activity of a target 
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polypeptide in a cell by at least 1 .5-fold to 2-fold, more 
desirably by at least 3-fold, and most desirably by at 
least 5-fold. 

[0066] By "a decrease" is meant a lowering in the level 
of: a) protein (e.g., as measured by ELISA or Western 
blot analysis); b) reporter gene activity (e.g., as meas- 
ured by reporter gene assay, for example, p-galactosi- 
dase, green fluorescent protein, or luciferase activity); 
c) mRNA (e.g., as measured by RT-PCR or Northern 
blot analysis relative to an internal control, such as a 
"housekeeping" gene product, for example, p-actin or 
glyceraldehyde 3-phosphate dehydrogenase (GAP- 
DH)); or d) cell function, for example, as assayed by the 
number of apoptotic, mobile, growing, cell cycle arrest- 
ed, invasive, differentiated, or dedifferentiated cells in a 
test sample. In all cases, the lowering is desirably by at 
least 20%, more desirably by at least 30%, 40%, 50%, 
60%, 75%, and most desirably by at least 90%. As used 
herein, a decrease may be the direct or indirect result 
of PTGS. 

[0067] By "an increase" is meant a rise in the level of: 
a) protein (e.g., as measured by ELISA or Western blot 
analysis); b) reporter gene activity (e.g., as measured 
by reporter gene assay, for example, p-galactosidase, 
green fluorescent protein, or luciferase activity); c) mR- 
NA (e.g., as measured by RT-PCR or Northern blot anal- 
ysis relative to an internal control, such as a "house- 
keeping" gene product, for example, p-actin or glycer- 
" aldehyde 3-phosphate dehydrogenase (GAPDH)); or d) 
cell function, for example, as assayed by the number of 
apoptotic, mobile, growing, cell cycle arrested, invasive, 
differentiated, or dedifferentiated cells in a test sample. 
Desirably, the increase is by at least 1.5-fold to 2-fold, 
more desirably by at least 3-fold, and most desirably by 
at least 5-fold. As used herein, an increase may be the 
indirect result of PTGS. For example, the double strand- 
ed RNA may inhibit the expression of a protein, such as 
a suppressor protein, that would otherwise inhibit the ex- 
pression of another nucleic acid. 
[0068] By "alteration in the level of gene expression" 
is meant a change in transcription, translation, or mRNA 
or protein stability such that the overall amount of a prod- 
uct of the gene, i.e., mRNA or polypeptide, is increased 
or decreased. 

[0069] By "reporter gene" is meant any gene that en- 
codes a product whose expression is detectable and/or 
able to be quantitated by immunological, chemical, bio- 
chemical, or biological assays. A reporter gene product 
may, for example, have one of the following attributes, 
without restriction: fluorescence (e.g., green fluorescent 
protein), enzymatic activity (e.g., p-galactosidase, luci- 
ferase, chloramphenicol acetyltransf erase), toxicity (e. 
g. t ricin A), or an ability to be specifically bound by an 
additional molecule (e.g., an unlabeled antibody, fol- 
lowed by a labelled secondary antibody, or biotin, or a 
detectably labelled antibody). It is understood that any 
engineered variants of reporter genes that are readily 
available to one skilled in the art, are also included, with- 


out restriction, in the foregoing definition. 
[0070] By "protein" or "polypeptide" or "polypeptide 
fragment" is meant any chain of more than two amino 
acids, regardless of post-translational modification (e. 
5 g. t glycosylation or phosphorylation), constituting all or 
part of a naturally-occurring polypeptide or peptide, or 
constituting a non-naturally occurring polypeptide or 
peptide. 

[0071] By "promoter" is meant a minimal sequence 

10 sufficient to direct transcription of a gene. Also included 
. in this definition are those transcription control elements 
(e.g., enhancers) that are sufficient to render promoter- 
dependent gene expression controllable in a cell type- 
specific, tissue-specific, or temporal-specific manner, or 

15 that are inducible by external signals or agents; such 
elements, which are well-known to skilled artisans, may 
be found in a 5' or 3* region of a gene or within an intron. 
Desirably a promoter is operably linked to a nucleic acid 
sequence, for example, a cDNA or a gene in such a way 

20 as to permit expression of the nucleic acid sequence. 
[0072] By "operably linked" is meant that a gene and 
one or more transcriptional regulatory sequences, e.g., 
a promoter or enhancer, are connected in such a way 
as to permit gene expression when the appropriate mol- 

25 ecules (e.g., transcriptional activator proteins) are 
bound to the regulatory sequences. 
[0073] By "expression vector" is meant a DNA con- 
struct that contains at least one promoter operably 
linked to a downstream gene or coding region (e.g., a 

30 cDN A or genomic DNA fragment that encodes a protein, 
optionally, operatively linked to sequence lying outside 
a coding region, an antisense RNA coding region, or 
RNA sequences lying outside a coding region). Trans- 
fection or transformation of the expression vector into a 

35 recipient ceil allows the cell to express RNA encoded by 
the expression vector. An expression vector may be a 
genetically engineered plasmid, virus, or artificial chro- 
mosome derived from, for example, a bacteriophage, 
adenovirus, retrovirus, poxvirus, or herpesvirus. 

40 [0074] By "transformation" or "transfection" is meant 
any method for introducing foreign molecules into a cell 
(e.g., a bacterial, yeast, fungal, algal, plant, insect, or 
animal cell, particularly a vertebrate or mammalian cell). 
The cell may be in an animal. Lipofection, DEAE-dex- 

45 tran-mediated transfection, microinjection, protoplast 
fusion, calcium phosphate precipitation, viral or retrovi- 
ral delivery, electroporation, and biolistic transformation 
are just a few of the transformation/transfection meth- 
ods known to those skilled in the art. The RNA or RNA 

so expression vector (DNA) may be naked RNA or DNA or 
local anesthetic complexed RNA or DNA (Pachuk etai, 
supra). Other standard transformation/transfection 
methods and other RNA and/or DNA delivery agents (e. 
g., a cationic lipid, liposome, or bupivacaine) are de- 

55 scribed in WO 00/63364, filed April 19, 2000 (see, for 
example, pages 18-26). Commercially available kits can 
also be used to deliver RNA or DNA to a cell. For exam- 
ple, the Transmessenger Kit from Qiagen, an RNA kit 
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from Xeragon Inc., and an RNA kit from DNA Engine 
Inc. (Seattle, WA) can be used to introduce single or 
double stranded RNA into a ceil. 
[0075] By "transformed cell" or "transfected cell" is 
meant a cell (or a descendent of a cell) into which a nu- 
cleic acid molecule, for example, a double stranded 
RNA or double stranded expression vector has been in- 
troduced, by means of recombinant nucleic acid tech- 
niques. Such cells may be either stably or transiently 
transfected. 

[0076] By "selective conditions" is meant conditions 
under which a specific cell or group of cells can be se- 
lected for. For example, the parameters of a fluores- 
cence-activated cell sorter (FACS) can be modulated to 
identify a specific cell or group of cells. Cell panning, a 
technique known to those skilled in the art, is another 
method that employs selective conditions. 
[0077] As use herein, by "optimized" is meant that a 
nucleic acid fragment is generated through inefficient 
first strand synthesis {e.g., reverse transcription (RT) 
and/or RT/second strand synthesis (RT-SSS) using Kle- 
now or other enzymes and/or RT-PCR or PCR, to be of 
a particular length. Desirably the length of the nucleic 
acid fragment is less than a full length cDNA or is 100, 
500, 600, or 1000 nucleotides in length. In other embod- 
iments, the nucleic acid fragment is at least 10, 20, 30, 
40, 50, 60, 70, 80, or 90 nucleotides in length. In yet 
other embodiments, the number of nucleotides in the 
nucleic acid fragment is betwee~n"5-100~nucleotides, 
15-100 nucleotides, 20-95 nucleotides, 25-90 nucle- 
otides, 35-85 nucleotides, 45-80 nucleotides, 50-75 nu- 
cleotides, or 55-70 nucleotides, inclusive. In still other 
embodiments, the number of nucleotides in the nucleic 
acid fragment is contained in one of the following rang- 
es: 5-15 nucleotides, 15-20 nucleotides, 20-25 nucle- 
otides, 25-35 nucleotides, 35-45 nucleotides, 45-60 nu- 
cleotides, 60-70 nucleotides, 70-80 nucleotides, 80-90 
nucleotides, or 90-100 nucleotides, inclusive. In other 
embodiments, the nucleic acid fragment contains less 
than 50,000; 10,000; 5,000; or 2,000 nucleotides. Opti- 
mization of the length of a nucleic acid can be achieved 
during first strand or second strand synthesis of a de- 
sired nucleic acid by lowering Mg ++ concentrations to 
no less than the nucleotide concentrations; by adding 
Mn ++ to the reaction to achieve the desired size selec- 
tion (e.g., by replacing Mg + + completely, or by adding 
Mn ++ at varying concentrations along with Mg++); by de- 
creasing and/or limiting concentrations of dNTP(s) to ef- 
fect the desired fragment size; by using various concen- 
trations of ddNTP(s) along with standard or optimal con- 
centrations of dNTP(s), to achieve varying ratios, to ob- 
tain the desired fragment size; by using limited and con- 
trolled exonuclease digestion of the fragment following 
RT, RT-SSS, RT-PCR, or PCR; or by a combination of 
any of these methods. 

[0078] As used herein, by "sized selected" is meant 
that a nucleic acid of a particular size is selected for use 
in the construction of dsRNA expression libraries as de- 


scribed herein. Desirably the size selected nucleic acid 
is less than a full length cDNA sequence or at least 100, 
500, 600, or 1 000 nucleotides in length. In other embod- 
iments, the nucleic acid is at least 10, 20, 30, 40, 50, 60, 

5 70, 80, or 90 nucleotides in length. In yet other embod- 
iments, the number of nucleotides in the nucleic acid is 
between 5-100 nucleotides, 15-100 nucleotides, 20-95 
nucleotides, 25-90 nucleotides, 35-85 nucleotides, 
45-80 nucleotides, 50-75 nucleotides, or 55-70 nucle- 

10 otides, inclusive. In still other embodiments, the number 
of nucleotides in the nucleic acid is contained in one of 
the following ranges: 5-15 nucleotides, 15-20 nucle- 
otides, 20-25 nucleotides, 25-35 nucleotides, 35-45 nu- 
cleotides, 45-60 nucleotides, 60-70 nucleotides, 70-80 

is nucleotides, 80-90 nucleotides, or 90-100 nucleotides, 
inclusive. In other embodiments, the nucleic acid con- 
tains less than 50,000; 10,000; 5,000; or 2,000 nucle- 
otides. For example, a nucleic acid may be size selected 
using size exclusion chromatography (e.g., as size ex- 

20 : elusion Sepharose matrices) according to standard pro- 
cedures (see, for example, Sambrook, Fritsch, and Ma- 
niatis, Molecular Cloning: A Laboratory Manual (2d ed.), 
Cold Spring Harbor Laboratory Press, Cold Spring Har- 
bor, NY, 1 989). 

25 [0079] By "under conditions that inhibit or prevent an 
interferon response or a dsRNA stress response" is 
meant conditions that prevent or inhibit one or more in- 
terferon responses or cellular RNA stress responses in- 
" voiving cell toxicity, cell death, an anti-prol iterative re- 

30 sponse, or a decreased ability of a dsRNA to carry out 
a PTGS event. These responses include, but are not 
limited to, interferon induction (both Type 1 and Type II), 
induction of one or more interferon stimulated genes, 
PKR activation, 2'5'-OAS activation, and any down- 

35 stream cellular and/or organismal sequelae that result 
from the activation/induction of one or more of these re- 
sponses. By "organismal sequelae" is meant any effect 
(s) in a whole animal, organ, or more locally (e.g., at a 
site of injection) caused by the stress response. Exem- 

40 piary manifestations include elevated cytokine produc- 
tion, local inflammation, and necrosis. Desirably the 
conditions that inhibit these responses are such that not 
more than 95%, 90%, 80%, 75%, 60%, 40%, or 25%, 
and most desirably not more than 10% of the cells un- 

45 dergo cell toxicity, cell death, or a decreased ability to 
carry out a PTGS event, compared to a cell not exposed 
to such interferon response inhibiting conditions, all oth- 
er conditions being equal (e.g., same cell type, same 
transformation with the same dsRNA expression library. 

so [0080] Apoptosis, interferon induction, 2 , 5' OAS acti- 
vation/induction, PKR induction/activation, antiprolifer- 
ative responses, and cytopathic effects are all indicators 
for the RNA stress response pathway. Exemplary as- 
says that can be used to measure the induction of an 

55 RNA stress response as described herein include a 
TUNEL assay to detect apoptotic cells, ELISA assays 
to detect the induction of alpha, beta and gamma inter- 
feron, ribosomal RNA fragmentation analysis to detect 
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activation of 2'5'OAS, measurement of phosphorylated 
elF2a as an indicator of PKR (protein kinase RNA in- 
ducible) activation, proliferation assays to detect chang- 
es in cellular proliferation, and microscopic analysis of 
cells to identify cellular cytopathic effects (see, e.g., Ex- 
ample 1 1 ). Desirably, the level of an interferon response 
or a dsRNA stress response in a cell transformed with 
a double stranded RNA or a double straned RNA ex- 
pression vector is less than 20, 10, 5,or 2-fold greater 
than the corresponding level in a mock-transfected con- 
trol cell under the same conditions, as measured using 
one of the assays described herein. In other embodi- 
ments, the level of an interferon response or a dsRNA 
stress response in a cell transformed with a double 
stranded RNA or a double straned RNA expression vec- 
tor using the methods of the present invention is less 
than 500%, 200%, 100%, 50%, 25%, or 10% greater 
than the corresponding level in a corresponding trans- 
formed cell that is not exposed to such interferon re- 
sponse inhibiting conditions, all other conditions being 
equal. Desirably, the double straned RNA does not in- 
duce a global inhibition of cellular transcription or trans- 
lation. 

[0081 ] By □ specifically hybridizes □ is meant a dou- 
ble stranded RNA that hybridizes to a target nucleic acid 
but does not substantially hybridize to other nucleic ac- 
ids in a sample (e.g., a sample from a cell) that naturally 
includes the target nucleic acid, when assayed under 
denaturing conditions. In one embodiment, the amount 
of a target nucleic acidhybridized to, or associated with, 
the double stranded RNA, as measured using standard 
assays, is 2-fold, desirably 5-fold, more desirably 
10-fold, and most desirably 50-fold greater than the 
amount of a control nucleic acid hybridized to, or asso- 
ciated with, the double stranded RNA. 
[0082] By □ high stringency conditions □ is meant hy- 
bridization in 2X SSC at 40°C with a DNA probe length 
of at least 40 nucleotides. For other definitions of high 
stringency conditions, see F. Ausubel etal., Current Pro- 
tocols in Molecular Biology, pp. 6.3.1-6.3.6, John Wiley 
& Sons, New York, NY, 1994, hereby incorporated by 
reference. 

[0083] Conditions and techniques that can be used to 
prevent an interferon response or dsRNA stress re- 
sponse during the screening methods of the present in- 
vention are described herein. 

Brief Description of the Drawing 

[0084] Fig. 1 is a schematic representation of a strat- 
egy to isolate clonally pure stable integrants that contain 
a single expression unit isolated from cells transfected 
with a double-stranded RNA encoding a cDNA library. 
[0085] Fig. 2 is a schematic illustration of the produc- 
tion of effector RNAs in cells expressing PSA. The PSA 
expression cassette used to create the transient PSA 
expression cell line is depicted at the top of the figure. 
Expression of PSA is driven by the HCMV IE promoter 


and the SV40 polyadenylation signal (pA). Only se- 
quences 3* of the PSA initiation codon have been used 
in these vectors. The effector RNA expression cassettes 
are shown below the PSA expression cassette and are 

5 designed to express PSA sense RNA, PSA antisense 
RNA, and PSA dsRNA. Expression of the effector RNAs 
is under the control of the T7 promoter (T7p). Transcrip- 
tion from T7p is catalyzed by T7 RNA polymerase, which 
is supplied by co-transfecting a T7 RNA polymerase ex- 

10 pression plasmid (not shown). Control effector RNA 
cassettes expressing irrelevant RNAs derived from the 
Herpes simplex virus glycoprotein D gene were included 
as controls. The 600 base pair sequence from the Her- 
. pes simplex gD gene is from Herpes Simplex virus 2 

15 strain 1 2 and maps to the coding region downstream of 
the gD initiation codon. 

[0086] Fig. 3 is a bar graph illustrating silencing of 
PSA expression by dsRNA. PSA levels in the super- 
nates of transfected cells were determined by ELISA 

20 and are plotted as percent expression of the PSA un- 
treated control. The PSA untreated control shown at the 
left is normalized to 1 00%. PSA levels in the supernates 
of cells transfected with the various effector PSA or con- 
trol RNAs are shown by the shaded and open bars re- 

25 spectively. All data shown is from day two post-transf ec- 
tion. Data from later time points were similar to the day 
two time point. 

[0087] Fig. 4 is a schematic illustration of the RNA 
stress response pathway, also known as the Type 1 in- 
30 terferon response. 

Detailed Description of the Invention 

[0088] Post-transcriptional gene silencing (PTGS) 
35 can be used as a tool to identify and validate specific 
unknown genes involved in cell function, gene expres- 
sion, and polypeptide biological activity. Although the 
use of PTGS as a validation strategy is well documented 
in invertebrates and plants, its use in identification of 
40 genes that modulate cell function, gene expression, or 
polypeptide biological activity, as described below, is 
novel. Since novel genes are likely to be identified 
through the methods of the present invention, PTGS is 
developed for use in validation and to identify novel tar- 
45 gets for use in therapies for diseases, for example, can- 
cer, neurological disorders, obesity, leukemia, lympho- 
mas, and other disorders of the blood or immune sys- 
tem. 

[0089] The present invention features methods to 
50 identify unknown targets that result in the modulation of 
a particular phenotype, an alteration of gene expression 
in a cell, or an alteration in polypeptide biological activity 
in a cell, using either a library based screening approach 
or a non-library based approach to identify nucleic acids 
55 that induce gene silencing. The present invention also 
allows the determination of function of a given se- 
quence. These methods involve the direct delivery of in 
vitro transcribed double stranded RNA (dsRNA), as well 
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as plasmid-based systems that direct the cell to make 
its own dsRNA. To avoid problems associated with 
transfection efficiency, plasm ids are designed to contain 
a selectable marker to ensure the survival of only those 
cells that have taken up plasmid DNA. One group of 
plasmids directs the synthesis of dsRNA that is tran- 
scribed in the cytoplasm, while another group directs the 
synthesis of dsRNA that is transcribed in the nucleus. 

Identification of genes by assaying for a modulation in 
cell function 

[0090] Functional identification of novel genes can be 
accomplished through the use of a number of different 
assays. For example, cells may be assayed for cell mo- 
tility, apoptosis, cell growth, cell invasion, vasculariza- 
tion, cell cycle events, cell differentiation, cell dediffer- 
entiation, neuronal cell regeneration, or the ability to 
support viral replication, as well as other cell functions 
known in the art. Methods for carrying out such function- 
al assays are well known and are described, for exam- 
ple, in Platet and Garcia (Invasion Metastasis 18: 
198-208, 1998-1999); Harper et al. (Neuroscience 88: 
257-267, 1999); and Tomaselli etal. (J. Cell Biol. 105: 
2347-2358, 1987), and are also described below. 
[0091] Functional identification of nucleic acid se- 
quences involved in modulating a particular cell function 
may be carried out by comparing cells transfected with 
a dsRNA to control cells that have not been transformed 
with a dsRNA or that have been mock-transfected, in a 
functional assay. A cell that has taken up sequences un- 
related to a particular function will perform in the partic- 
ular assay in a manner similar to the control cell. A cell 
experiencing PTGS of a gene involved in the particular 
function will exhibit an altered ability to perform in the 
functional assay compared to the control. 
[0092] The percent modulation of a particular cell 
function that identifies a nucleic acid sequence that 
modulates the function of a cell will vary depending on 
the assay, phenotype, and the particular nucleic acid af- 
fected by PTGS. For each assay, the percent modula- 
tion can readily be determined by one skilled in the art, 
when used in conjunction with controls, as described 
herein. Desirably the modulation is at least 20%, more 
desirably at least 30%, 40%, 50%, 60%, 75%, and most 
desirably at least 90% compared to the control. An in- 
crease in the function of a cell can also be measured in 
terms of fold increase, where desirably, the increase is 
at least 1 .5-fold to 5-fold compared to the control 
[0093] Alternatively, the function of a cell may be to 
affect the function, gene expression, or polypeptide bi- 
ological activity of another cell, for example, a neighbor- 
ing cell, a cell that is contacted with the cell in which a 
PTGS event occurs, or a cell that is contacted with me- 
dia or other extracellular fluid that the cell in which a 
PTGS event occurs is contained in. For example, a cell 
experiencing PTGS of a gene may modulate cell motil- 
ity, apoptosis, cell growth, cell invasion, vascularization, 


cell cycle events, cell differentiation, cell dedifferentia- 
tion, neuronal cell regeneration, or the ability to support 
viral replication of a nearby cell, or a cell that is exposed 
to media or other extracellular fluid in which the trans- 

5 fected cell in which a PTGS event occurs was once con- 
tained. This can be tested by removing the media in 
which a cell experiencing a PTGS event is occurring and 
placing it on a separate cell or population of cells. If the 
function of the separate cell or population of cells is mod- 

10 ulated, compared to a cell or population of cells receiv- 
ing media obtained from cells that had been mock trans- 
fected, then one or more of the cells experiencing a 
PTGS event can affect the function of another cell. The 
identity of the nucleic acid sequence that causes the 

is modulation can be identified with repeated rounds of se- 
lection. 

[0094] In another method, a single cell experiencing 
a PTGS event can be placed in proximity of a cell or a 
population of cells that was not transfected with dsRNA, 

20 and the effect of this placement is evaluated for a mod- 
ulation in the function of the cell or population of ceils. 
If the function of the non-transfected cell or population 
of cells is modulated, compared to a cell or population 
of cells in proximity of a cell that was mock transfected, 

25 then the cell experiencing a PTGS event contains a nu- 
cleic acid sequence that can affect the function of an- 
other cell. This nucleic acid sequence can be identified 
using techniques described herein. 

30 Identification of genes using differential gene 
expression 

[0095] Differential gene expression analysis can be 
used to identify a nucleic acid sequence that modulates 
35 the expression of a target nucleic acid in a cell. Altera- 
tions in gene expression induced by gene silencing can 
be monitored in a cell Into which a dsRNA has been in- 
troduced. For example, differential gene expression can 
be assayed by comparing nucleic acids expressed in 
cells into which dsRNA has been introduced to nucleic 
acids expressed in control cells that were not transfect- 
ed with dsRNA or that were mock-transfected. Gene ar- 
ray technology can be used in order to simultaneously 
examine the expression levels of many different nucleic 
45 acids. Examples of methods for such expression anal- 
ysis are described by Marrack et al. (Current Opinions 
in Immunology 12:206-209, 2000); Harkin (Oncologist 
5:501 -507, 2000); Pelizzari etal. (Nucleic Acids Res. 28: 
4577-4581, 2000); and Marx (Science 289:1670-1672, 
so 2000). 

Identification of genes by assaying polypeptide 
biological activity 

55 [0096] Novel nucleic acid sequences that modulate 
the biological activity of a target polypeptide can also be 
identified by examining polypeptide biological activity. 
Various polypeptide biological activities can be evaluat- 
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ed to identify novel genes according to the methods of 
the invention. For example, the expression of a target 
polypeptide(s) may be examined. Alternatively, the in- 
teraction between a target polypeptide(s) and another 
molecule(s), for example, another polypeptide or a nu- 
cleic acid may be assayed. Phosphorylation or glyco- 
syiation of a target polypeptide(s) may also be as- 
sessed, using standard methods known to those skilled 
in the art. 

[0097] Identification of nucleic acid sequences in- 
volved in modulating the biological activity of a target 
polypeptide may be carried out by comparing the 
polypeptide biological activity of a cell transfected with 
a dsRNA to a control cell that has not been transfected 
with a dsRNA or that has been mock-transfected. A cell 
that has taken up sequences unrelated to a particular 
polypeptide biological activity will perform in the partic- 
ular assay in a manner similar to the control cell. A cell 
experiencing PTGS of a gene involved in the particular 
polypeptide biological activity will exhibit an altered abil- 
ity to perform in the biological assay, compared to the 
control. 

Insertion of single units into the chromosome and 
generation of a ceil line containing a single dsRNA 
expression library integrant 

[0098] The present invention involves the generation 
of a target cell line in which the dsRNA expression library 
is subsequently introduced. Through the use of site-spe- 
cific recombination, single integrants of dsRNA expres- 
sion cassettes are generated at the same locus of all 
cells in the target cell line, allowing uniform expression 
of the dsRNA in all of the integrants. A dsRNA expres- 
sion library derived from various cell lines is used to cre- 
ate a representative library of stably integrated cells, 
each cell within the target cell line containing a single 
integrant. Cre//ox, Lambda-Cro repressor, and Flp re- 
combinase systems or retroviruses are used to gener- 
ate these singular integrants of dsRNA expression cas- 
settes in the target cell line (Satoh et a/., J. Virol. 74: 
10631-10638, 2000; Trinh era/., J. Immunol. Methods 
244:185-193, 2000; Serov era/., An. Acad. Bras. Cienc. 
72:389-398, 2000; Grez era/., Stem Cells. 16:235-243, 
1 998; Habu etal., Nucleic Acids Symp. Ser. 42:295-296, 
1999; Haren etal., Annu. Rev. Microbiol. 53:245-281, 
1999; Baer et a/., Biochemistry 39:7041-7049, 2000; 
Follenzi et a/., Nat. Genet. 25:217-222, 2000; Hind- 
marsh et a/., Microbiol. Mol. Biol. Rev. 63:836-843, 
1999; Darquet et a/. f Gene Ther. 6:209-218, 1999; Dar- 
quet etal., Gene Ther. 6:209-218, 1999; Yu era/., Gene 
223:77-81, 1998; Darquet et a/., Gene Ther. 4: 
1341-1349, 1997; and Koch etal., Gene 249:135-144, 
2000). These systems are used singularly to generate 
singular insertion clones, and also in combination. 
[0099] The following exemplary sequence specific in- 
tegrative systems use short target sequences that allow 
targeted recombination to be achieved using specific 


proteins: FLP recombinase, bacteriophage Lambda in- 
tegrase, HIV integrase, and pilin recombinase of Salmo- 
nella (Seng et at. Construction of a Flp "exchange cas- 
sette" contained vector and gene targeting in mouse ES 

5 cell] A book chapter PUBMED entry 11 797223 - Sheng 
Wu Gong Cheng Xue Bao. 2001 Sep;17(5):566-9., Liu 
etal., Nat Genet. 2001 Jan 1;30(1):66-72., Awatramani 
etal., Nat Genet. 2001 Nov;29(3):257-9. t Heidmannand 
Lehner, Dev Genes Evol. 2001 Sep;211(8-9):458-65, 

10 Schaft et a/. t Genesis. 2001 Sep;31(1):6-10, Van 
Duyne, Annu Rev Biophys Biomol Struct. 2001 ;30: 
87-104., Lorbach era/., J Mol Biol. 2000 Mar 10;296(5): 
1175-81., Darquet etal., Gene Ther. 1999 Feb;6(2): 
209-18., Bushman and Miller, J Virol. 1997 Jan;71(1): 

15 458-64., Fulks et a/., J Bacteriol. 1990 Jan;172(1): 
31 0-6). A singular integrant is produced by randomly in- 
serting the specific sequence (e.g., loxP in the ere re- 
combinase system) and selecting or identifying the cell 
that contains a singular integrant that supports maximal 

20 expression. For example, integrants that show maximal 
expression following random integration can be identi- 
fied through the use of reporter gene sequences asso- 
ciated with the integrated sequence. The cell can be 
used to specifically insert the expression cassette into 

25 the site that contains the target sequence using the spe- 
cific recombinase, and possibly also remove the expres- 
sion cassette that was originally placed to identify the 
maximally expressing chromosomal location. A skilled 
artisan can also produce singular integrants using ret- 

30 roviral vectors, which integrate randomly and singularly 
into the eukaryotic genome. In particular, singular inte- 
grants can be produced by inserting retroviral vectors 
that have been engineered to contain the desired ex- 
pression cassette into a naive cell and selecting for the 

35 chromosomal location that results in maximal expres- 
sion (Michael et ai, EMBO Journal, vol 20: pages 
2224-2235, 2001; Reik and Murrell., Nature, vol. 405, 
page 408-409, 2000; Berger et ai, Molecular Cell, vol. 
8, pages 263-268). One may also produce a singular 

40 integrant by cotransfecting the bacterial RecA protein 
with or without nuclear localization signal along with se- 
quences that are homologous to the target sequence (e. 
g„ a target endogenous sequence or integrated trans- 
gene sequence). Alternatively, a nucleic acid sequence 

45 that encodes a RecA protein with nuclear localization 
signals can be cotransfected (Shibata et a/. f Proc Natl 
Acad Sci USA. 2001 Jul 17;98(15):8425-32. Review., 
Muyrers er a/., Trends Biochem Sci. 2001 May;26(5): 
325-31 ., Paul era/., MutatRes. 2001 Jun5;486(1):11-9., 

50 Shcherbakova et a/., Mutat Res. 2000 Feb 16;459(1): 
65-71., Lantsov. Mol Biol (Mosk). 1994 May-Jun;28(3): 
485-95). 

[0100] An example utilizing such methods is detailed 
below. 

55 

Creation of the target cell line 

[0101] Target cell lines are the same cell lines as the 
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ones from which the dsRNA expression libraries will be 
derived. Target cells are created by transfecting the se- 
lected cell line with a bicistronic plasmid expressing a 
selectable marker, such as G418 and the reporter gene 
GFP. The plasmid also bears a loxP site. Plasmids inte- 
grate randomly into the chromosome through the proc- 
ess of illegitimate recombination at a frequency of 10* 4 . 
Following transfection, cells containing integrants are 
selected by culturing the cells in the presence of G418 
at a concentration determined earlier in a kill curve anal- 
ysis. About a dozen G418-resistant colonies are ex- 
panded and relative GFP expression levels are deter- 
- mined using flow cytometry. DNA from the-cells is ana- 
lyzed by Southern blot analysis to determine integrant 
copy number. Several single copy integrants exhibiting 
the highest GFP expression levels are then selected as 
the target cell lines. GFP expression is monitored be- 
cause dsRNA encoding templates are then integrated 
into the loci containing the loxP, GFP, and G418 cas- 
settes in a site-specific fashion, and it is important to 
ensure that these loci are transcriptionally active. Since 
cells are selected on the basis of G41 8 resistance and 
GFP expression, integration of the plasmid DNA can oc- 
cur at the loxP site, destroying its function. Several cell 
lines are therefore chosen to reasonably ensure that at 
least one integrant has an intact loxP site. 

Double stranded RNA expression library construction 
and site-specific recombination into the target cell line 

[0102] A cDNA library or a randomized library is con- 
structed from RNA isolated from selected cell lines. cD- 
NAs or randomized nucleic acids in the size range of at 
least 100 to 1000 nucleotides, for example, 500 to 600 
nucleotides are optimized during synthesis or are size- 
selected prior to cloning. In other embodiments, the nu- 
cleic acids are at least 10, 20, 30, 40, 50, 60, 70, 80, or 
90 nucleotides in length. In yet other embodiments, the 
. number of nucleotides in the nucleic acids is between 
5-100 nucleotides, 15-100 nucleotides, 20-95 nucle- 
otides, 25-90 nucleotides, 35-85 nucleotides, 45-80 nu- 
cleotides, 50-75 nucleotides, or 55-70 nucleotides, in- 
clusive. In still other embodiments, the number of nucle- 
otides in the nucleic acids is contained in one of the fol- 
lowing ranges: 5-15 nucleotides, 15-20 nucleotides, 
20-25 nucleotides, 25-35 nucleotides, 35-45 nucle- 
otides, 45-60 nucleotides, 60-70 nucleotides, 70-80 nu- 
cleotides, 80-90 nucleotides, or 90-100 nucleotides, in- 
clusive. In other embodiments, the nucleic acid contains 
less than 50,000; 10,000; 5,000; or 2,000 nucleotides. 
Each cDNA or randomized nucleic acid is then cloned 
into a plasmid vector as a dsRNA transcription cassette 
flanked by two convergent promoters (such as T7 pro- 
moters as described herein). The promoters are tran- 
scriptionally regulated such that they are off until in- 
duced, for example, using a tet ON/OFF system (Forster 
ef a/., Nucleic Acids Res. 27:7708-710, 1999; Liu etal., 
Biotechniques 24:624-628, 6,30-632, 1998; and Gatz, 


Methods Cell Biol. 50:411-424, 1995). The plasmid also 
contains the hygromycin resistance gene and an invert- 
ed loxP site. The cDNA plasmid library or randomized 
plasmid library is then co-transfected into the target cell 

5 line with a plasmid expressing Cre recombinase, which 
catalyzes site-specific recombination of the transfected 
cDNA plasmid or randomized nucleic acid plasmid at the 
inverted loxPsiie into the chromosomal locus containing 
the GFP gene and loxP site (see Fig. 1). The use of the 

10 Oxellox system allows the efficient integration of a plas- 
mid into the chromosome (every transfected cell is pre- 
dicted to undergo a plasmid integration event). Other 
site-specific recombination strategies can also be uti- 
lized. This results in having every integration to occur at 

is the same site, thereby obviating potential problems with 
loci dependent expression. 

[01 03] Two days following transfection, cells are incu- 
bated in the presence of hygromycin to kill untransf ected 
cells and to select for stable integrants. Transcription of 

20 dsRNA is induced, and selected cells are assayed for 
an alteration in cell function, the biological activity of a 
target polypeptide, or differential gene expression. Cells 
expressing dsRNA corresponding to a target nucleic ac- 
id exhibit an altered function, for example, increased or 

25 decreased cell invasion, motility, apoptosis, growth, dif- 
ferentiation, dedifferentiation, or regeneration, or the 
ability of the cell to support viral replication. Cells exhib- 
iting altered function are then expanded and the se- 
quence of the integrant is determined. Targets are iden- 

30 tified and validated using dsRNA specific for the identi- 
fied target, or other non-PTGS mediated methods, for 
example antisense technology. 
[0104] The regulated transcription system of the 
present invention provides an advantage to other dou- 

35 ble stranded expression systems. Following transfec- 
tion of the dsRNA library, cells contain hundreds to thou- 
sands of dsRNA expression cassettes, with concomi- 
tant expression of that many expression cassettes. In 
the dsRNA expression system of the present invention, 

40 dsRNA expression cassettes contained within the ex- 
pression vector integrate into the chromosome of the 
transfected cell. As described in detail below, every 
transfected cell integrates one of the double stranded 
expression cassettes. Desirably no transcription occurs 

45 until the episomal (non-integrated) expression vectors 
are diluted out of the cell such that not more than 5 ep- 
isomal vectors remain in the cell. Most desirably no tran- 
scription occurs until all of the episomal (non-integrated) 
expression vectors are diluted out of the cell and only 

so the integrated expression cassette remains (a process 
usually taking about two to several weeks of cell cul- 
ture). At this time transcription is induced, allowing dsR- 
NA to be expressed in the ceils. This method ensures 
that only one species of dsRNA is expressed per cell, 

55 as opposed to other methods that express hundreds to 
thousands of double stranded species. The use of the 
above-described system results in the loss of ail but one 
expression cassette, which in turn, permits the rapid 
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screening of libraries without requiring screening multi- 
ple pools of libraries to identify the target gene. 

Non-library approaches for the identification of a nucleic 
acid sequence that modulates cell function, gene 
expression in a cell, or the biological activity of a target 
polypeptide in a cell through PTGS 

[0105] Nucleic acid sequences that modulate cell 
function, gene expression in a cell, or the biological ac- 
tivity of a target polypeptide in a cell may also be iden- 
tified using non-library based approaches involving 
PTGS. For example, a single known nucleic acid se- 
quence encoding a polypeptide with unknown function 
or a single nucleic acid fragment of unknown sequence 
and/or function can be made into a double stranded 
RNA molecule. This dsRNA is then transfected into a 
desired cell type and the cell is assayed for modulations 
in cell function, gene expression of a target nucleic acid 
in the cell, or the biological activity of a target polypep- 
tide in the cell, using methods described herein. A mod- 
ulation in cell function, gene expression in the cell, or 
the biological activity of a target polypeptide in the cell 
identifies the nucleic acid of the dsRNA as a nucleic acid 
the modulates the specific cell function, gene expres- 
sion, or the biological activity of a target polypeptide. As 
a single dsRNA species is transfected into the cells, the 
nucleic acid sequence responsible for the modulation is 
readily identified. This non-library based approach to 
nucleic acid identification is desirably used under con- 
ditions that inhibit an interferon response or dsRNA 
stress response. Such conditions are described in detail 
herein. 

[0106] The discovery of novel genes through the 
methods of the present invention may lead to the gen- 
eration of novel therapeutics. For example, genes that 
decrease cell invasion may be used as targets for drug 
development, such as for the development of cytostatic 
therapeutics for use in the treatment of cancer. Devel- 
opment of such therapeutics is important because cur- 
rently available cytotoxic anticancer agents are also tox- 
ic for normal rapidly dividing cells. In contrast, a cyto- 
static agent may only need to check metastatic process- 
es, and by inference, slow cell growth, in order to stabi- 
lize the disease. In another example, genes that in- 
crease neuronal regeneration may be used to develop 
therapeutics for the treatment, prevention, or control of 
a number of neurological diseases, including Alzheim- 
er's disease and Parkinson's disease. Genes that are 
involved in the ability to support viral replication and be 
used as targets in anti-viral therapies. Such therapies 
may be used to treat, prevent, or control viral diseases 
involving human immunodeficiency virus (HIV), hepati- 
tis C virus (HCV), hepatitis B virus (HBV), and human 
papillomavirus (HPV). The efficacies of therapeutics tar- 
geting the genes identified according to the present in- 
vention can be further tested in cell culture assays, as 
well as in animal models. 


The use of vertebrate or mammalian cell lines for 
identification of nucleic acid sequences that modulate 
cell function, expression of a target nucleic acid or 
biological activity of a target polypeptide 

5 

[0107] While the use of the present invention is not 
limited to vertebrate or mammalian cells, such cells can 
be used to carry out the nucleic acid identification meth- 
ods described herein. Desirably the vertebrate or mam- 

10 malian cells used to carry out the present invention are 
cells that have been cultured for only a small number of 
passages (e.g., less than 30 passages of a cell line that 
has been obtained directly from American Type Culture 
Collection), or are primary cells. In addition, vertebrate 

is or mammalian cells can be used to carry out the present 
invention when the dsRNA being transfected into the 
cell is not complexed with cationic lipids/ 
[01 08] The following examples are to illustrate the in- 
vention. They are not meant to limit the invention in any 

20 way. For example, it is noted that any of the following 
examples can be used with double stranded RNAs of 
any length. The methods of the present invention can 
be readily adapted by one skilled in the art to utilize dou- 
ble stranded RNAs of any desired length. 

25 

Example 1 : Design and delivery of vectors for 
intracellular synthesis of dsRNA for library based 
screening approaches to nucleic acid identification 
using PTGS 

30 

[0109] PTGS is induced when dsRNA is made intra- 
cellularly. The library based screening approaches to 
nucleic acid identification through PTGS may require 
that dsRNA reside in certain cellular compartments in 
35 order to exert its effect. Therefore, expression plasmids 
that transcribe dsRNA in the cytoplasm and in the nu- 
cleus are utilized. There are two classes of nuclear tran- 
scription vectors: one that is designed to express poly- 
adenylated dsRNA (for example, a vector containing an 
40 RNA polymerase II promoter and a poly A site) and one 
that expresses non-adenylated dsRNA (for example, a 
vector containing an RNA polymerase II promoter and 
no poly A site, or a vector containing a T7 promoter). 
Different cellular distributions are predicted for the two 
45 species of RNA; both vectors are transcribed in the nu- 
cleus, but the ultimate destinations of the RNA species 
are different intracellular locations. Intracellular tran- 
scription may also utilize bacteriophage T7 and SP6 
RNA polymerase, which may be designed to transcribe 
so in the cytoplasm or in the nucleus. Alternatively, Qbeta 
replicase RNA-dependent RNA polymerase may be 
used to amplify dsRNA. Viral RNA polymerases, either 
DNA and RNA dependent, may also be used. Alterna- 
tively, dsRNA replicating polymerases can be used. Cel- 
55 lular polymerases such as RNA Polymerase I, II, or III 
or mitochondrial RNA polymerase may also be utilized. 
Both the cytoplasmic and nuclear transcription vectors 
contain an antibiotic resistance gene to enable selection 


35 


40 


17 


33 


EP 1 229 134 A2 


34 


of cells that have taken up the plasmid. Cloning strate- 
gies employ chain reaction cloning (CRC), a one-step 
method for directional ligation of multiple fragments (Pa- 
chuk etat., Gene 243:19-25, 2000). Briefly, the ligations 
utilize bridge oligonucleotides to align the DNA frag- 
ments in a particular order and ligation is catalyzed by 
a heat-stable DNA ligase, such as Amptigase, available 
from Epicentre. 

Inducible or repressible transcription vectors for the 
generation of a dsRNA expression library 

_ [01 10] If desired, inducible and repressible transcrip- 
tion systems can be used to control the timing of the 
synthesis of dsRNA. Inducible and repressible regula- 
tory systems involve the use of promoter elements that 
contain sequences that bind prokaryotic or eukaryotic 
transcription factors upstream of the sequence encod- 
ing dsRNA. In addition, these factors also carry protein 
domains that transactivate or transrepress the RNA 
polymerase II. The regulatory system also has the ability 
to bind a small molecule (e.g., a coinducer or a com- 
pressor). The binding of the small molecule to the reg- 
ulatory protein molecule (e.g., a transcription factor) re- 
sults in either increased or decreased affinity for the se- 
quence element. Both inducible and repressible sys- 
tems can be developed using any of the inducer/tran- 
scription factor combinations by positioning the binding 
site appropriately with respect to the promoter se- 
quence. Examples of previously described inducible/re- 
pressible systems include lacl, ara, Steroid-RU486, and 
ecdysone - Rheogene, Lac (Cronin et al. Genes & De- 
velopment 15: 1506-1517, 2001), ara (KhlebnikovetaL, 
J Bacteriol. 2000 Dec;182(24):7029-34), ecdysone 
(Rheogene, www.rheogene.com), RU48 (steroid, Wang 
XJ, Liefer KM, Tsai S, O'Malley BW, Roop DR., Proc Natl 
AcadSciUS A. 1999 Jul 20;96(15):8483-8), fef promot- 
er (Rendal et a/., Hum Gene Ther. 2002 Jan;13(2): 
335-42. and Larnartina et a/., Hum Gene Ther. 2002 
Jan; 13(2):1 99-210), or a promoter disclosed in WO 
00/63364, filed April 19, 2000. 

Nuclear transcription vectors for the generation of a 
nuclear dsRNA expression library 

[0111] Nuclear transcription vectors for use in library 
based screening approaches to identify nucleic acids 
that modulate cell function, gene expression, or the bi- 
ological activity of a target polypeptide are designed 
such that the target sequence is flanked on one end by 
an RNA polymerase II promoter (for example, the HC- 
MV-IE promoter) and on the other end by a different 
RNA polymerase II promoter (for example, the SCMV 
promoter). Other promoters that can be used include 
other RNA polymerase II promoters, an RNA polymer- 
ase I promoter, an RNA polymerase III promoter, a mi- 
tochondrial RNA polymerase promoter, or a T7 or SP6 
promoter in the presence of T7 or SP6 RNA polymerase, 


respectively, containing a nuclear localization signal. 
Bacteriophage or viral promoters may also be used. The 
promoters are regulated transcriptionally (for example, 
using a tet ON/OFF system (Forster et al., supra; Liu et 

5 al., supra; and Gatz, supra) such that they are only ac- 
tive in either the presence of a transcription-inducing 
agent or upon the removal of a repressor A single chro- 
mosomal integrant is selected for, and transcription is 
induced in the cell to produce the nuclear dsRNA. 

10 [0112] Those vectors containing a promoter recog- 
nized by RNA Pol I, RNA Pol II, or a viral promoter in 
conjunction with co-expressed proteins that recognize 
the viral promoter, may also contain optional sequences 
located between each promoter and the inserted cDNA. 

is These sequences are transcribed and are designed to 
prevent the possible translation of a transcribed cDNA. 
For example, the transcribed RNA is synthesized to con- 
tain a stable stem-loop structure at the 5" end to impede 
ribosome scanning. Alternatively, the exact sequence is 

20 irrelevant as long as the length of the sequence is suf- 
ficient to be detrimental to translation initiation (e.g., the 
sequence is 200 nucleotides or longer). The RNA se- 
quences can optionally have sequences that allow 
polyA addition, intronic sequences, an HIV REV binding 

25 sequence, Mason-Pfizer monkey virus constitutive 
transport element(CTE) (U.S. 5,880,276, filed April 25, 
1996), and/or self splicing intronic sequences. 
[0113] To generate dsRNA, two promoters can be 
placed on either side of the target sequence, such that 

30 the direction of transcription from each promoter is op- 
posing each other. Alternatively, two plasmids can be 
cotransfected. One of the plasmids is designed to tran- 
scribe one strand of the target sequence while the other 
is designed to transcribe the other strand. Single pro- 

35 moter constructs may be developed such that two units 
of the target sequence are transcribed in tandem, such 
that the second unit is in the reverse orientation with re- 
spect to the other. Alternate strategies include the use 
of filler sequences between the tandem target sequenc- 

40 es. 

Cytoplasmic transcription vectors for the generation of 
a cytoplasmic dsRNA expression library 

45 [01 1 4] Cytoplasmic transcription vectors for use in li- 
brary based screening approaches to identifying nucleic 
acids that modulate cell function, gene expression, or 
the biological activity of a target polypeptide in a ceil us- 
ing PTGS are made according to the following method. 

so This approach involves the transcription of a single 
stranded RNA template (derived from a library) in the 
nucleus, which is then transported into the cytoplasm 
where it serves as a template for the transcription of 
dsRNA molecules. The DNA encoding the ssRNA is in- 

55 tegrated at a single site in the target cell line as de- 
scribed for the nuclear RNA expression library, thereby 
ensuring the synthesis of only one species of dsRNA in 
a cell, each cell expressing a different dsRNA species. 
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[0115] A desirable approach is to use endogenous 
polymerases such as the mitochondrial polymerase in 
animal cells or mitochondrial and chloroplast polymer- 
ases in plant cells for cytoplasmic and mitochondrial (e. 
g., chloroplast) expression to make dsRNA in the cyto- 5 
plasm. These vectors are formed by designing expres- 
sion constructs that contain mitochondrial or chloroplast 
promoters upstream of the target . sequence. As de- 
scribed above for nuclear transcription vectors, dsRNA 
can be generated using two such promoters placed on io 
either side of the target sequence, such that the direc- 
tion of transcription from each promoter is opposing 
each other. Alternatively, two plasm ids can be cotrans- 
fected. One of the plasmids is designed to transcribe 
one strand of the target sequence while the other is de- ts 
signed to transcribe the other strand. Single promoter 
constructs may be developed such that two units of the 
target sequence are transcribed in tandem, such that 
the second unit is in the reverse orientation with respect 
to the other. Alternate strategies include the use of filler 20 
sequences between the tandem target sequences. 
[0116] Alternatively, cytoplasmic expression of dsR- 
NA for use in library based screening approaches is 
achieved by a single subgenomic promoter opposite in 
orientation with respect to the nuclear promoter. The nu- 25 
clear promoter generates one RNA strand that is trans- 
ported into the cytoplasm, and the singular subgenomic 
promoter at the 3' end of the transcript is sufficient to 
generate its antisense copy by an RNA dependent RNA 
polymerase to result in a cytoplasmic dsRNA species, so 

Target cell line development for use with cytoplasmic 
dsRNA expression libraries 

[01 1 7] The target cell line, using the vector containing 35 
the G418 cassette, GFP, and loxP site is designed as 
described above. 

Development of a cytoplasmic dsRNA expression 

library *o 

[01 1 8] Double stranded RNA expression libraries are 
generated by inserting cDNA or randomized sequences 
(as described herein) into an expression vector contain- 
ing a single nuclear promoter (RNA polymerase I, RNA <5 
polymerase II, or RNA polymerase III), which allows 
transcription of the insert sequence. It is desirable that 
this nuclear promoter activity is regulated transcription- 
ally (for example, using a tet ON/OFF system described, 
for example, by Forster et al., supra; Liu et a/., supra; so 
and Gatz, supra), such that the promoters are only ac- 
tive in either the presence of a transcription-inducing 
agent or upon the removal of a repressor. This ensures 
that transcription is not induced until episomal copies of 
the vector(s) are diluted out. Vectors also contain a se- 55 
lectable marker, such as the hygromycin resistance 
gene, and a loxP site. The expression vectors are inte- 
grated into the target cell line by methods previously de- 


scribed in this application using Ore recombinase (other 
site-specific recombinative strategies can be employed, 
as described previously). 

[0119] At two days post-transfection, cells are sub- 
jected to hygromycin selection using concentrations es- 
tablished in kill curve assays. Surviving cells are cul- 
tured in hygromycin to select for cells bearing integrated 
vectors and to dilute out episomal copies of the vector 
(s). At this point transcription is induced, and a single 
stranded RNA (ssRNA) species derived from the insert 
sequence is transcribed in the nucleus from the nuclear 
promoter in the inserted vector. The insert is designed 
such that the insert sequences in the transcript are 
flanked by bidirectional promoters of RNA bacteri- 
ophages (for example, Qbeta or MS2, RNA dependent 
RNA polymerase promoters) or cytoplasmic viral RNA- 
dependent RNA polymerase promoter sequences (for 
example, those of Sindbis or VEEV subgenomic pro- 
moters). The nuclear transcript is translocated to the cy- 
toplasm where it acts as a template for dsRNA by an 
RNA dependent RNA polymerase, which may be pro- 
vided through co-transfection of a vector that encodes 
an RNA-dependent RNA polymerase. Alternatively, an 
integrated copy of the polymerase may be used. 

Example 2: Generation of templates for in vitro 
transcription of dsRNA for non-library based 
approaches for identification of nucleic acids using 
PTGS 

[0120] Nucleic acid sequences that modulate cell 
function, gene expression in a cell, or the biological ac- 
tivity of a target polypeptide in a cell may also be iden- 
tified using non-library based approaches involving 
PTGS. A single known nucleic acid sequence encoding 
a polypeptide with unknown function or a single nucleic 
acid fragment of unknown sequence and/or function can 
be made into a double stranded RNA molecule. This 
dsRNA is then transfected into a desired cell type and 
assayed for modulations in cell function, gene expres- 
sion in the cell, or the biological activity of a target 
polypeptide in the cell, using methods described herein. 
A modulation in cell function, gene expression in the cell, 
or the biological activity of a target polypeptide in the 
cell identifies the nucleic acid of the dsRNA as a nucleic 
acid the modulates the specific cell function, gene ex- 
pression, or the biological activity of a target polypep- 
tide. This non-library based approach to nucleic acid 
identification is desirably used under conditions that in- 
hibit an interferon response or dsRNA stress response. 
Such conditions are described in detail below. 
[0121] Nucleic acid fragments generated, for exam- 
ple, by PCR or restriction endonuclease digestion, en- 
coding the respective target sequences were used as 
templates for in vitro transcription reactions. PCR frag- 
ments are superior to plasmid templates for the synthe- 
sis of discrete sized RNA molecules. The PCR frag- 
ments encoded at least 20-50 or 1 00 to 1 000, for exam- 
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pie, 500 to 600 nucleotides (nts) of the target sequence 
and were derived from the target mRNA. Known target 
sequences were obtained from GenBank and or other 
DNA sequence databases. Target sequences were also 
obtained from cellular RNAs that were generated into 
cDNAs to create a number of different dsRNA mole- 
cules. Accordingly, it is possible that the sequence and/ 
or function of the target sequence was not known at the 
time the dsRNA was generated. 
[01 22] Templates for sense target RNAs were gener- 
ated by placing the bacteriophage T7 promoter at the 5' 
end of the target coding strand while antisense RNA 
-templates contained the-T7-promoter atthe 5' end of the 
non-coding strand. This was achieved by encoding the 
T7 promoter at the 5' ends of the respective PCR prim- 
ers. Alternatively SP6 promoters, or a combination of 
SP6 and T7 promoters may be used. 
[01 23] PCR was performed by conventional methods. 
The use of both PCR templates in equimolar amounts 
in an in vitro transcription reaction resulted in primarily 
dsRNA. The use of two separate fragments has been 
found to be superior to the use of one PCR fragment 
containing two T7 promoters, one located at each end 
of the target sequence, presumably due to transcription 
interference that occurs during transcription of the dual 
promoter template. Following PCR amplification, the 
DNA was subjected to Proteinase K digestion and phe- 
nol-chloroform extraction to remove contaminating 
RNases. Following ethanol precipitation, the DNA was 
resuspended in RNase-free water at a concentration of 
1 to 3 ng/uJ. 

[01 24] As an alternative to phenol-chloroform extrac- 
tion, DNA can be purified in the absence of phenol using 
standard methods such as those described by Li et al. 
(WO 00/44914, filed January 28, 2000). Alternatively, 
DNA that is extracted with phenol and/or chloroform can 
be purified to reduce or eliminate the amount of phenol 
and/or chloroform. For example, standard column chro- 
matography can be used to purify the DNA (WO 
00/44914, filed January 28, 2000). 

Example 3: In vitro RNA transcription and RNA analysis 

[0125] In vitro transcription reactions are carried out 
using the Riboprobe Kit (Promega Corp.), according to 
the manufacturer's directions. The template DNA is as 
described above. Following synthesis, the RNA is treat- 
ed with RQ1 DNase (Promega Corp.) to remove tem- 
plate DNA. The RNA is then treated with Proteinase K 
and extracted with phenol-chloroform to remove con- 
taminating RNases. The RNA is ethanol precipitated, 
washed with 70% ethanol, and resuspended in RNase- 
free water. Aliquots of RNA are removed for analysis 
and the RNA solution is flash frozen by incubating in an 
ethanol-dry ice bath. The RNA is stored at -80°C. 
[01 26] As an alternative to phenol-chloroform extrac- 
tion, RNA can be purified in the absence of phenol using 
standard methods such as those described by Li et al. 


(WO 00/44914, filed January 28, 2000). Alternatively, 
RNA that is extracted with phenol and/or chloroform can 
be purified to reduce or eliminate the amount of phenol 
and/or chloroform. For example, standard column chro- 

s matography can be used to purify the RNA (WO 
00/44914, filed January 28, 2000). 
[0127] dsRNA is made by combining equimolar 
amounts of PCR fragments encoding antisense RNA 
and sense RNA, as described above, in the transcription 

10 reaction. Single stranded antisense or sense RNA is 
made by using a single species of PCR fragment in the 
reaction. The RNA concentration is determined by spec- 
trophotometry analysis, and -RNA quality is assessed 
by denaturing gel electrophoresis and by digestion with 

15 RNase T1 , which degrades single stranded RNA. 
[0128] An mRNA library is produced using Qbeta bac- 
teriophage, by Hgating the mRNAs to the flank sequenc- 
es that are required for Qbeta replicase function (Qbeta 
flank or Qbeta flank plus P1), using RNA ligase. The 

20 ligated RNAs are then transformed into bacteria that ex- 
press Qbeta replicase and the coat protein. Single 
plaques are then inoculated into fresh bacteria. All 
plaques are expected to carry transgene sequences. 
Each plaque is grown in larger quantities in bacteria that 

25 produce the Qbeta polymerase, and RNA is isolated 
from the bacteriophage particles. Alternatively, if the 
Qbeta flank plus P1 is used to generate the library (e. 
g., P1=MS2, VEEV, or Sindbis promoter sequences), 
these vectors can be used to carry out the in vitro tran- 

30 scription along with the cognate polymerase. The in vitro 
made dsRNA is then used to transfect cells. 

RNA delivery 

35 [0129] In vitro made dsRNA is directly added to the 
cell culture medium at concentrations ranging from 50 
u.g/ml to 500 u.g/ml. Uptake of dsRNA is also facilitated 
by electroporation using those conditions required for 
DNA uptake by the desired cell type. RNA uptake is also 

40 mediated by lipofection using any of a variey of com- 
mercially available and proprietary cationic lipids, DE- 
AE-dextran-mediated transfection, microinjection, pro- 
toplast fusion, calcium phosphate precipitation, viral or 
retroviral delivery, or biolistic transformation. The RNA 

45 is naked RNA or a local anesthetic RNA complex. Mod- 
ulation of cell function, gene expression, or polypeptide 
biological activity is then assessed in the transfected 
cells. 

[0130] Some dsRNA sequences, possibly in certain 
so cell types and through certain delivery methods, may re- 
sult in an interferon response. During the screening 
methods of the present invention, induction of an inter- 
feron response is not desired, as this would lead to cell 
death and possibly to the prevention of gene silencing. 
55 [0131] One of the components of an interferon re- 
sponse is the induction of the interferon-induced protein 
kinase PKR. Suppression of the interferon response 
and/or the PKR response is desired in the target cells. 
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The dsRNA delivery methods described herein are per- 
formed such that an interferon response or dsRNA 
stress response is not included. It is recognized, how- 
ever, that certain conditions might present with an in- 
duction of the interferon response. To prevent such a 
response, a number of other strategies may be em- 
ployed with any of the above described screening meth- 
ods to identify a nucleic acid that modulates cell func- 
tion, gene expression, or the polypeptide biological ac- 
tivity of a cell, as described herein. 
[0132] To prevent an interferon response, interferon 
and PKR responses are silenced in the target cells using 
a dsRNA species directed against the mRNAs that en- 
code proteins involved in the response. Desirably inter- 
feron response promoters are silenced using dsRNA. 
Alternatively, the expression of proteins that bind the in- 
terferon response element is abolished using dsRNA 
techniques. 

[0133] In an alternative strategy, interferon and PKR 
knockout cell lines are created through approaches uti- 
lizing expression cassettes that encode an antisense 
RNA and ribozymes directed to the cellular mRNAs that 
encode the proteins involved in the response. Knockout 
cells are created by standard gene knockout technolo- 
gies using homologous recombination to alter target se- 
quences, using homologous DNA alone, or as complex- 
es of RecA protein and single stranded DNA homolo- 
gous to the target sequence(s). Interferon response el- 
ement (IRE) sequences, sequences that encode tran- 
scription factors that bind IRE sequences, the promoter 
and/or gene sequences that encode proteins in the PKR 
and interferon response pathways are molecules that 
are targeted for knockout. 

[0134] If desired, proteins involved in gene silencing 
such as Dicer or Argonaut can be overexpressed or ac- 
tivated to increase the amount of inhibition of gene ex- 
pression (Beach et a/., WO 01/68836, filed March 16, 
2001). 

Example 4: Cytoplasmic transcription vectors for non- 
library based approaches to nucleic acid identification 
using PTGS 

[0135] Double stranded RNA molecules for use in 
non-library based methods for the identification of nu- 
cleic acids that modulate cell function, gene expression 
of a target nuclec acid, or target polypeptide biological 
activity in a cell can also be generated through the use 
of cytoplasmic transcription vectors. Such vectors are 
generated as now described. 

[01 36] The PCR fragments generated for in vitro tran- 
scription templates, as described above, are inserted in- 
to a cloning vector containing one T7 promoter located 
just outside the polylinker region. Such a vector is pZE- 
RO blunt (Promega Corp.). The PCR fragment is cloned 
into a restriction site in the polylinker in such a way that 
the fragment's T7 promoter is distal to the vector's pro- 
moter. The resulting vector contains the target se- 


quence flanked by two T7 promoters; transcription from 
this vector occurs in converging directions. Convergent 
transcription is desired for these intracellular vectors, 
due to the uncertainty of getting sense and antisense 

s vectors into the same cell in high enough and roughly 
equivalent amounts. In addition, the local concentration 
of antisense and sense RNAs with respect to each other 
is high enough to enable dsRNA formation when the du- 
al promoter construct is used. 

10 [0137] A hygromycin resistance cassette is cloned in- 
to the pZERO blunt vector as well. The hygromycin re- 
sistance cassette contains the hygromycin resistance 
gene under the control of the Herpes Simplex Virus 
(HSV) thymidine kinase promoter and the SV40 polya- 

15 denlyation signal. The cassette is in a plasmid vector 
and is flanked at both ends by a polylinker region ena- 
bling ease of removal and subsequent cloning. Hygro- 
mycin selection was chosen because of the rapidity of 
death induced by hygromycin as well as extensive in- 

20 house experience with hygromycin selection. Alterna- 
tively, other selection methods known to those skilled in 
the art may be used. 

[0138] The vectors are transfected into the desired 
cells using standard transformation or transfection tech- 
25 niques described herein, and the cells are assayed for 
the ability of the dsRNA molecules encoded by the vec- 
tors to modulate cell function, gene expression of a tar- 
get nuclec acid, or the biological activity of a target 
polypeptide, as described herein. 

30 

Example 5: Analysis of RNA from transfected cells 

[01 39] Regardless of whether a library based screen- 
ing approach or a non-library based approach was used 

35 to identify nucleic acid sequences, in order to measure 
the level of dsRNA effector molecule within the cell, as 
well as the amount of target mRNA within the cell, a two- 
step reverse transcription PCR reaction was performed 
with the ABI PRISM™ 7700 Sequence Detection Sys- 

40 tern. Total RNA was extracted from cells transfected with 
dsRNA or a plasmid from a dsRNA expression library 
using Trizol and DNase. Two to three different cDNA 
synthesis reactions were performed per sample; one for 
human GAPDH (a housekeeping gene that should be 

45 unaffected by the effector dsRNA), one for the target 
mRNA, and/or one for the sense strand of the expected 
dsRNA molecule (effector molecule). Prior to cDNA syn- 
thesis of dsRNA sense strands, the RNA sample was 
treated with T1 RNase. The cDNA reactions were per- 

50 formed in separate tubes using 200 ng of total RNA and 
primers specific for the relevant RNAs. The cDNA prod- 
ucts of these reactions were used as templates for sub- 
sequent PCR reactions to amplify GAPDH, the target 
cDNA, and/or the sense strand copied from the dsRNA. 

55 All RNA was quantified relative to the internal control, 
GAPDH. 
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Example 6: Target sequence identification 

[0140] To identify the target sequence affected by a 
dsRNA, using any of the above-described methods, 
DNA is extracted from expanded cell lines (or from the 
transfected cells if using a non-integrating dsRNA sys- 
tem) according to methods well known to the skilled ar- 
tisan. The dsRNA encoding sequence of each integrant 
(or non-integrated dsRNA molecule if using a non-li- 
brary based method) is amplified by PCR using primers 
containing the sequence mapping to the top strand of 
the T7 promoter (or any other promoter used to express 
-the dsRNA). Amplified DNA is then cloned into a cloning 
vector, such as pZERO blunt (Promega Corp.), and then 
sequenced. Sequences are compared to sequences in 
GenBank and/or other DNA databases to look for se- 
quence identity or homology using standard computer 
programs. If the target mRNA remains unknown, the 
mRNA is cloned from the target cell line using primers 
derived from the cloned dsRNA by established tech- 
niques (Sambrook et al., supra). Target validation is then 
carried out as described herein. 
[0141] In the stably integrated dsRNA expression sys- 
tem described above, despite efforts to reduce negative 
position effects, inefficient dsRNA synthesis by PCR 
methods may occur. This can be circumvented by res- 
cuing the integrated cDNA or randomized nucleic se- 
quences into replicating plasmids. Rescued plasmids 
are amenable to amplification in bacteria and to se- 
quencing. Rescue is achieved by re-transfecting the 
population of cells transfected with the dsRNA expres- 
sion library with the rescue plasmid and a plasmid en- 
coding Cre recombinase. The rescue plasmid carries a 
bacterial origin of replication, a bacterial antibiotic se- 
lection marker, an SV40 origin of replication, and an 
SV40 T antigen expression cassette, as well as loxP 
sites positioned as an inverted repeat to allow Cre-me- 
diated double recombination. The SV40-based origin of 
replication in the rescue plasmid allows amplification of 
rescued sequences in the integrated cells. Following 
rescue, higher levels of transcription are anticipated, 
thereby favoring dsRNA formation. The cells are then 
screened for modulations in cell function, target nucleic 
acid expression, or target polypeptide biological activity 
changes as described herein. 

Example 7: Prevention of an interferon response during 
gene silencing 

[0142] As discussed above, during the above-de- 
scribed screening methods, induction of an interferon 
response is not desired, as this would lead to cell death, 
anti-proliferative responses, and possibly to prevention 
of gene silencing. One of the components of an interfer- 
on response is the induction of the interferon-induced 
protein kinase PKR. Suppression of the interferon re- 
sponse and/or the PKR response is desired in the target 
cells. The dsRNA delivery methods described herein 


are performed such that an interferon response is not 
included. It is recognized, however, that certain condi- 
tions might present with an induction of the interferon 
response. To prevent such a response, a number of oth- 
s er strategies may be employed with any of the above 
described screening methods to identify a nucleic acid 
that modulates cell function, gene expression or the 
polypeptide, biological activity of a cell, as described 
herein. 

10 [0143] To prevent an interferon response in a system 
involving stable integration of the nucleic acid containing 
the dsRNA expression cassette, the vectors used to 
generate either the loxP integrant or the vector that en- 
codes the dsRNA expression cassette are designed to 

is contain sequences that encode proteins that block the 
PKR response, such as the Vaccinia virus protein E3 
(Romano et a/., Molecular and Cellular Biology 18: 
7304-731 6, 1 998; Accession No. M36339>, or a cellular 
protein P58 ,PK , which the influenza virus mobilizes to 

20 block PKR (Gale et al., Microbiology and Molecular Bi- 
ology Reviews 64:239-280, 2000; Accession No. 
XM_032882). Several other viral proteins have also 
been identified (e.g., Hepatitis C E2; Accession No. 
S72725) and may be similarly used. These proteins can 

25 be expressed in the desired cell types or in animals 
through the use of any of a number of commercially 
available mammalian expression vectors or vertebrate 
expression vectors. Such vectors can be obtained from 
a number of different manufacturers including Invitrogen 

30 (Carlsbad, CA) Promega ((Madison, Wl), or Clontech 
(Palo Alto, CA). An example of such a vector is the pCI- 
neo Mammalian Expression Vector from Promega. 
[0144] Regardless of whether nucleic acid encoding 
a dsRNA is stably integrated into a chromosome or is 

35 not integrated into a chromosome, the following meth- 
ods may be used to prevent an interference response 
in any of the screening methods of the present invention. 
In one example of an interferon avoidance strategy, in- 
terferon and PKR responses are silenced in the target 

40 cells using a dsRNA species directed against the mR- 
NAs that encode proteins involved in the response. De- 
sirably interferon response promoters are silenced us- 
ing dsRNA. Alternatively, the expression of proteins that 
bind the interferon response element is abolished using 

45 dsRNA techniques. 

[0145] In an alternative strategy, interferon and PKR 
knockout cell lines are created through approaches uti- 
lizing expression cassettes that encode an antisense 
RNA and ribozymes directed to the cellular mRNAs that 

so encode the proteins involved in the response. Knockout 
cells are created by standard gene knockout technolo- 
gies using homologous recombination to alter target se- 
quences, using homologous DNA alone, or as complex- 
es of RecA protein and single stranded DNA homolo- 

55 gous to the target sequence(s). Interferon response el- 
ement (IRE) sequences, sequences that encode tran- 
scription factors that bind IRE sequences, the promoter 
and/or gene sequences that encode proteins in the PKR 
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and interferon response pathways are molecules that 
are targeted for knockout. 

[01 46] In yet another alternative, chimeric oligonucle-. 
otides may be used to alter target sequences. Methods 
for inhibiting expression of.polypeptides through chimer- 
ic oligonucleotides are well known in the art (Igoucheva 
and Yoon, Human Gene Therapy 11 :2307-2312, 2000). 

Example 8: Functional screening for cell invasion 

[0147] Cell invasion is one cell function that may be 
evaluated in the search for novel genes that are modu- 
lated using the methods described herein. Matrigel, a 
biological extracellular matrix, has properties similar to 
that of a reconstituted basement membrane and has 
been used to measure the invasive potential of tumor 
cells (Platet and Garcia, supra). Cells transfected with 
randomized or cDNA libraries that have been cloned in- 
to PTGS vectors are monitored for their capacity to in- 
vade matrigel invasion chambers. Cells that have taken 
up sequences unrelated to invasion invade the matrigel 
as efficiently as vector-transfected control cells. Cells 
experiencing PTGS of genes that are involved in cell in- 
vasion invade much less efficiently. If the dsRNA ex- 
pression cassette is stably integrated in a chromosome, 
these cells are retrieved and second and third rounds of 
selection are carried out in order to isolate specific nu- 
cleic acid sequences relevant to cell invasion. The effect 
of these sequences on invasion is ultimately confirmed 
by their ability to block the formation of tumors in animal 
models. 

[0148] Several human cell lines, for example, 
MDA-MB-231 , used by Platet and Garcia (supra), 
SKBr3, and MCF-7ADR, a more metastatic variant of 
MCF-7. MDA-MB-231 breast cancer cells (obtained 
from the American Type Culture Collection) are also 
transfected with cDNA -libraries or randomized nucleic 
acid libraries constructed into the vectors described 
above. Desirably ail cells in this assay contain a single 
copy of a transfected gene, as described above. 
[01 49] Cells cultured in commercially available 24- or 
96-well formatted systems are used to carry out the cell 
invasion assay. As this screening protocol relies upon 
repeated rounds of selection, it may be desirable to keep 
the cell numbers in each well low enough that enrich- 
ment is seen in each succeeding round, yet high enough 
to recover sufficient cells to culture within a reasonable 
time period. Therefore, culture conditions that result in 
invasion by greater than 50% of the cells and that still 
permit recovery from the surface of the matrigel are 
made optimal. Non-invasive (NIH3T3 cells) or poorly in- 
vasive (MCF7) cell lines are analyzed in parallel as neg- 
ative controls for invasion. 

[0150] Initially, triplicate cultures of half-log order di- 
lutions from 10 2 to 10 6 cells per well are plated. Cells 
are then recovered by "scrubbing" with a sterile cotton 
swab in fresh culture media and are seeded into 96-well 
plates. The number of invasive cells in the matrigel is 


quantified using either an MTT-based assay (Sasaki 
and Passaniti, Biotechniques 24:1038-1043, 1998) or a 
fluorescent indicator (Gohla era/., Clin. Exp. Metastasis 
14:451-458,1996), 

s [0151] Once the appropriate cell densities for the as- 
say have been empirically determined, stable transfect- 
ed cells are plated in the matrigel cell invasion cham- 
bers. Each experiment includes the following controls: 
a sample of untransfected cells as a reference culture; 

10 untransfected cells treated with anti-invasive chemo- 
therapeutic agents, such as taxol or doxorubicin, as a 
positive control for inhibition of invasion; cells transfect- 
ed with empty vectors to confirm that the vector alone 
had no effects on invasion; and cells transfected cells 

is with genes that are known to block invasion in this as- 
say, such as estrogen receptor-a or TIMP-2 (Kohn et 
a/. t Cancer Research 55:1856-1862, 1995; and Wood- 
house et a/., Cancer (Supplement) 80:1529-1536, 
1997). 

20 [0152] Cells that fail to invade the matrigel are re- 
moved from each well to the corresponding wells of a 
96-well plate and cultured until macroscopic colonies 
are visible. It is important to collect cells at more than 
one time point after plating, since the time it takes for 

25 PTGS to be effective may vary, and it may be that dif- 
ferent genes are active at different times after plating. 
Once the cells are transferred to 96-well plates, they are 
diluted out and taken through successive rounds of re- 
screening in the invasion assay in order to expand and 

30 isolate cell lines with altered invasive ability. As the pop- 
ulation becomes more and more enriched for cells with 
a non-invasive phenotype, the reduction in invasive 
cells in the matrigel can be better quantified via MTT or 
fluorescence assays. Ultimately, a large panel of cloned 

35 double-stable cell lines is generated. 

[0153] This assay can also be carried out with cells 
into which a dsRNA is not stably integrated into a chro- 
mosome. The assay is conducted essentially as de- 
scribed above except that multiple rounds of selection 

40 and re-screening are not necessary since the cell is 
transfected with only one dsRNA species. Thus, the tar- 
gets) of the PTGS event is readily identifiable using the 
cloning and sequencing techniques described above. 

45 Example 9: Downregulation of HIV using HIV-derived 
dsRNA and inhibitors of the interferon response 
pathway 

[01 54] During the course of HIV infection, the viral ge- 
so nome is reverse transcribed into a DNA template that is 
integrated into the host chromosome of infected dividing 
cells. The integrated copy is now a blueprint from which 
more HIV particles are made. Several cell lines that con- 
tain integrated copies of a defective HIV genome, 
55 HIVgpt (strain HXB2) have been created. The HIVgpt 
genome contains a deletion of the HIV envelope gene; 
all other HIV proteins are encoded. The plasmid used 
to create these cell lines, HIVgpt, was obtained from the 
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AIDS Research and Reference Reagent Program Cat- 
alog. Stably integrated cell lines were made with human 
rhabdomyosarcoma (RD) cells. The lines were made by 
transfecting cells with the plasmid followed by selection 
. of cells in mycophenolic acid. The HIVgpt genome en- 5 
codes a mycophenolic acid (MP A) resistance gene in 
place of the envelope gene and thereby confers resist- 
ance to MPA. Cells resistant to MPA were clonally am- 
plified. The media from the cultured clonally expanded 
cells was assayed for the presence of p24 (an HIV gag 10 
polypeptide that is secreted extracellularly). All cell lines 
were positive for p24, as assessed using a p24 ELISA 
assay kit (Coulter, Fullerton, CA). The cell lines also 
make non-infectious particles which can be rescued into 
infectious particles by co-expression of an H IV envelope is 
protein. 

[0155] The HIVgpt cell lines are used as a model sys- 
tem with which to downregulate HIV expression via 
PTGS. Plasmids encoding a 600 nt sense RNA, a 600 
nt antisense RNA, or a 600 bp double stranded RNA 20 
(dsRNA), mapping to the same coordinates of the gag 
gene of HIV strain HXB2 are used to transfect cells (the 
map from which the coordinates are based is found at 
GenBank Accession number K03455, HIV(HXB2), com- 
plete genome, and the gag RNAs used in this study map 25 
to coordinates 901-1500). Expression of the RNAs is 
from T7 RNA polymerase promoter(s) located at the 5' 
end of the gag sense strand, at the 5' end of the anti- 
_sense_ strand,, or at conyerging positions at the„5^ends 
of both the sense and anti-sense strands, respectively. 30 
These encoded RNAs are not designed to be able to 
make protein (i.e., they do not have a cap, a poly A tail, 
or the native initiation codon). Transcription of the RNAs 
is catalyzed by T7 RNA polymerase, provided from a 
second co-transfected T7 RNA polymerase expression 35 
plasmid. Control plasmids expressing a similar sized 
sense RNA, antisense RNA, and dsRNA derived from 
the gD gene of an HSV2 genome are included as ex- 
perimental controls (the map from which the coordinates 
are based is found at GenBank Accession number <o 
K01408, HSVgD2 gene, and the HSVgD RNAs used in 
this study map to coordinates 313-872). 
[01 56] Cells used in these studies are transfected with 
an expression plasmid encoding a gene product known 
to interfere with the dsRNA induced interferon response 45 
or with the PKR response, as described above. The cells 
are transfected with lipofectamine (Gibco-BRL) as a 
transfecting agent according to the manufacturer's in- 
structions. 

[0157] Two days after transfection, the cells are har- so 
vested and seeded into six-well plates and cultured to 
approximately 80 to 90% confluence. Cells are co-trans- 
fected with the T7 RNA polymerase expression plasmid 
and one of the RNA expression plasmids, such that one 
well of cells receives the T7 RNA polymerase expres- 55 
sion plasmid and the gag sense RNA expression plas- 
mid; one well of cells receives the T7 RNA polymerase 
expression plasmid and the gag antisense RNA expres- 


sion plasmid; one well of cells receives the T7 RNA 
polymerase expression plasmid and the gag dsRNA ex- 
pression plasmid; one well of cells receives the T7 RNA 
polymerase expression plasmid and the HSVgd sense 
RNA expression plasmid; one well of cells receives the 
T7 RNA polymerase expression plasmid and the HSVgd 
antisense RNA expression plasmid; and one well of 
cells receives the T7 RNA polymerase expression plas- 
mid and the HSVgD dsRNA expression plasmid. Trans- 
fection is again mediated through lipofectamine (Gib- 
co-BRL). There also is a control group of cells receiving 
no RNA. The cells are monitored for p24 synthesis over 
the course of several weeks. The cells are assayed both 
by measuring p24 in the media of cells (using the p24 
ELISA kit from Coulter, according to the manufacturer's 
instructions) and by immunostaining fixed cells for p24 ' 
using a rabbit polyclonal anti-p24 sera and anti-rabbit 
IgG that is FITC conjugated (Sigma). None of the gD 
RNAs specifically shut down p24 synthesis. The double 
stranded gag RNA significantly down regulates p24. 
The sense and antisense have only a modest effect on 
p24 synthesis and some of the effect is predicted to be 
through the ability of the sense and antisense gag RNAs 
to generate low levels of dsRNA species. 

Example 10: Downregulation of PSA expression in 
human Rhabdomyosarcoma cells using intracellular 
expression of dsRNA 

[0158] RD cells transiently expressing prostate spe- 
cific antigen (PSA) were transfected with a T7 RNA 
polymerase expression vector and T7 RNA expression 
vectors expressing PSA dsRNA, PSA sense RNA, PSA 
antisense RNA, or control RNAs. The ability of the ex- 
pressed RNAs to downregulate PSA expression was 
assessed, as described further below. 

Creation of a transient PSA expression line 

[01 59] The ability to downregulate expression of PSA 
following the expression of a PSA specific double- 
stranded RNA (dsRNA) was demonstrated in a human 
rhabdomyosarcoma cell line. Since available PSA cell 
lines are difficult to work with (i.e., they are hard to trans- 
feet, and the cells tend to clump), a human cell line tran- 
siently expressing PSA was created. To create these 
cells, human rhabdomyosarcoma cells were transiently 
transfected with a PSA plasmid-based expression vec- 
tor, under conditions that result in transfection of greater 
than 95% of the cells. Transfection was mediated with 
lipofectamine transfecting reagent (Gibco-BRL) accord- 
ing to the manufacturer's instructions. Expression of 
PSA was directed by the HCMV-IE promoter and the 
SV40 polyadenylation signal (Fig. 2). PSA expression 
was measured in the supernatant of transfected cells 
using a PSA ELISA kit (Oncogene Science Diagnostics, 
Cambridge, MA). No PSA was detected in the untrans- 
fected parental cells while PSA was abundantly ex- 
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pressed in cells receiving the PSA expression vector. 

Downregulation of PSA expression 

[0160] The PSA expressing cell line was used as a 
model system with which to demonstrate the downreg- 
ulation of PSA protein levels by PTGS. In these studies, 
piasmids encoding an approximately 600 nt sense RNA, 
a 600 nt antisense RNA or a 600 nt dsRNA derived from 
a PSA cDNA were used to transfect the PSA expressing io 
cell line (Fig. 2). Expression of the RNAs was from a T7 
RNA polymerase promoter(s) located at the 5' end of 
the PSA sense strand, at the 5' end of the PSA antisense 
strand, or at converging positions at the 5' ends of both 
the sense and antisense strands respectively (Fig. 2). *5 
These encoded RNAs are not designed to be able to 
make protein (they do not have a cap, or a poly A tail). 
Transcription of the RNAs was catalyzed by T7 RNA 
polymerase, provided by a co-transfected T7 RNA ex- 
pression plasmid. Control piasmids expressing similar 20 
sized sense RNA, antisense RNA, and dsRNA derived 
from the glycoprotein D (gD) gene of an Herpes simplex 
2 (HSV-2) genome, as described above were included 
as experimental controls. 

[0161] Cells used in these studies can optionally be 25 
transfected with an expression plasmid encoding a gene 
product known to interfere with the dsRNA induced in- 
terferon response or with the PKR response, as de- 
scribed above. The cells are transfected with lipo- 
fectamine (Gibco-BRL) as a transfecting agent accord- 30 
ing to the manufacturer's instructions. 
[0162] Human rhabdomyosarcoma cells were seed- 
ed into six-well plates and cultured to approximately 80 
to 90% confluence. The cells were co-transfected with 
(A) the PSA expression plasmid; (B) one of the T7 RNA 35 
expression piasmids; and (C) the T7 RNA polymerase 
expression plasmid, such that all PSA expressing cells 
were transfected with the T7 RNA polymerase expres- 
sion plasmid and one of the following: the T7 sense PSA 
RNA expression construct, the T7 antisense PSA RNA *o 
expression construct, the T7 dsRNA PSA expression 
construct, the sense HSVgD RNA expression construct, 
the antisense HSVgD expression construct, or the dsR- 
NA HSVgd expression construct. Cells received identi- 
cal amounts of the PSA expression plasmid and the T7 45 
RNA expression plasmid. The amounts of the T7 RNA 
expression piasmids were also constant amongst the 
transfected cells. Total DNA per transfection was held 
constant at 2.5 ug DNA per one well of a six-well plate. 
In those transfections where there was no T7 RNA ex- so 
pression plasmid, an inert plasmid DNA was used as 
filler DNA. Transfection was mediated by lipofectamine 
(Gibco-BRL) according to the manufacturer's instruc- 
tions. There was also a control group of untransfected 
cells, as well as an untreated PSA control group of cells 55 
transfected with only the PSA expression plasmid in 
combination with the T7 RNA polymerase expression 
plasmid. 


[0163] PSA-expressing cells that were not transfect- 
ed with the T7 RNA expression plasmid, as well as cells 
transfected with the T7 HSV2-gDRNA expression plas- 
mid all expressed PSA abundantly and at comparable 
levels. Cells transfected with the sense, antisense, and 
ds PSA RNA expression piasmids all exhibited varying 
degrees of inhibition of PSA expression. A 5%-1 0% re- 
duction in expression was seen in cells expressing the 
PSA sense RNA, a 50% reduction was seen in cells ex- 
pressing the PSA antisense RNA and greater than 95% 
reduction was seen in the cells expressing the PSA dsR- 
NA (Fig. 3). The inhibition was seen within two days after 
transfection and continued up until the last time point 
taken (one month later) at which point PSA levels were 
beginning to decline in the untreated ceils and the ex- 
periment was terminated. The untreated PSA controls 
as well as cells transfected with the T7 HSV2-gD control 
RNA expression piasmids all expressed PSA abundant- 
ly and at comparable levels (Fig. 3), indicating the spe- 
cificity of dsRNA effectors to silence gene expression. 
During the one of month culture, cells were expanded 
into larger cultures at routine intervals. 
[0164] Although the PSA specific dsRNA induced sig- 
nificant inhibition of PSA expression, antisense and 
sense PSA RNAs also induced some level of inhibition. 
Antisense PSA RNA has the potential to form dsRNA 
by annealing with PSA mRNA. Therefore the inhibition 
seen with antisense RNA may be explained by both an 
antisense mechanism and a dsRNA induced inhibition. 
A critical intracellular concentration of both antisense 
RNA and mRNA is required to generate dsRNA. Since 
much less dsRNA is made in the antisense RNA ex- 
pressing cells relative to those cells designed to make 
dsRNA, a lesser inhibition of PSA in the antisense RNA 
expressing cells is expected if the threshold dsRNA lev- 
els required for efficient silencing have not been 
reached in those ceils. We have also demonstrated that 
a small amount of antisense RNA can be found in cells 
transfected with our expression vectors (approximately 
0.2% the amount of mRNA steady state levels). Anti- 
sense expression is presumably driven by cryptic pro- 
moters on the non-coding plasmid DNA strand. The ob- 
served sense RNA inhibition could therefore also in- 
volve a dsRNA molecule. RNA from transfected but un- 
treated cells could also be analyzed to determine if the 
low level expression of antisense RNAs in these cells 
results in the production of detectable dsRNA species. 
Some low level expression of PSA occurred in cells ex- 
pressing PSA dsRNA. It is likely that some percentage 
of ceils did not take up the dsRNA expression cassette 
or that the threshold levels of dsRNA were not reached 
in some ceils. No cellular toxicity was seen with any of 
the dsRNAs generated by the RNA expression vectors 
suggesting that cytoplasmic expression of dsRNA does 
not induce the interferon response. In contrast, cell 
death is induced when certain concentrations of in vitro 
produced dsRNA is delivered to cells via transfection 
with certain cationic lipids. 
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[01 65] In summary, these results indicate that (i) PSA 
derived dsRNA is much more efficient than PSA anti- 
sense RNA in down-regulating PSA expression, (ii) the 
down-regulation is sequence specific; only the PSA de- 
rived dsRNA and not the control HSV-2 derived dsRNA 
induced down-regulation of PSA, and (iii) there is no tox- 
icity associated with the cytoplasmic expression of long 
(600 bp) dsRNA molecules. Additionally, these experi- 
ments are the first demonstration of dsRNA mediated 
down-regulation of gene expression in a human cell line. 

Example 1 1 : Intracellular expression of dsRNA does not 
induce the type 1 interferon response (RNA stress 
response). 

[0166] Human rhabdomyosarcoma (RD) cells were 
transfected with various dsRNA expression vectors 
such that dsRNA was transcribed in the transfected cells 
as described in Example 1 0. Transcription of dsRNA oc- 
curred in the cells within 24 hours after transfection and 
continued for the duration of the thirty day experiment. 
Cells and the supernatants from transfected cells were 
analyzed during the course of the experiment for any 
evidence of RNA stress response induction. No evi- 
dence of RNA stress response induction by intracellular 
expressed dsRNA was observed. RD cells have been 
shown by us to be responsive to type 1 interferon, both 
alpha and beta, and thus RD cells are capable of mount- 
ing an RNA stress response. In addition, positive con- 
trols were included in these experiments. A positive con- 
trol for these experiments is a method of delivering dsR- 
NA which induces the RNA stress response. All positive 
controls induced the RNA stress response. These ex- 
periments are described further below. 

Assays performed to identify RNA stress response 
induction 

[01 67] The following assays were performed to meas- 
ure the induction of an RNA stress response: TUNEL 
assay to detect apoptotic cells, ELISA assays to detect 
the induction of alpha, beta and gamma interferon, ri- 
bosomal RNA fragmentation analysis to detect activa- 
tion of 2 , 5'OAS, measurement of phosphorylated elF2a 
as an indicator of PKR (protein kinase RNA inducible) 
activation, proliferation assays to detect changes in cel- 
lular proliferation, and microscopic analysis of cells to 
identify cellular cytopathic effects. Apoptosis, interferon 
induction, 2'5' OAS activation, PKR activation, antipro- 
liferative responses, and cytopathic effects are all indi- 
cators for the RNA stress response pathway. 

Transfection ofcelis 

[01 68] Approximately 7x1 0 s RD cells were seeded in- 
to individual wells of six-well plates. Cells were trans- 
fected when they reached about 90% confluency. Cells 
were transfected with a T7 RNA polymerase expression 


construct and a T7 dsRNA expression construct. The 
T7 dsRNA expression constructs encode converging T7 
promoters located on either side of a 600 bp sequence 
(Fig. 2). Controls included cells transfected with the T7 

5 RNA expression construct alone so that no dsRNA is 
made in these cells. Total DNA per transfection was held 
constant at 2.5 \ig DNA per one well of a six-well plate. 
When the T7 RNA polymerase and T7 dsRNA expres- 
sion vectors were both used, 1 .25 u.g of each DNA was 

10 used per transfection. In those transfections where 
there was no T7 dsRNA expression construct, inert filler 
DNA was used to bring the total DNA to 2.5 u.g per trans- 
fection. Transfection was mediated using Lipofectamine 
(InVitrogen) according to the manufacturer's directions. 

is The positive control transfections included poly(l)(C) 
RNA and in vitro transcribed dsRNA of 600 bp that were 
both complexed with Lipofectamine and transfected into 
cells. The cells were transfected with in vitro transcribed 
ssRNA complexed with Lipofectamine. 2.5 u,g of each 

20 RNA was used per transfection. Other controls included 
untreated cells. Cells were kept in culture for one month 
by expanding into larger flasks as the cell numbers in- 
creased. 

25 ELISA assays 

[0169] Supernatants were removed from the trans- 
fected and untreated cells at time points of 1, 2, 7, 17, 
and 48 hours and every several days for up to one month 

30 after the 48 hour time point. Collected supernatants 
were stored at -80°C until they were analyzed for the 
presence of alpha, beta, and gamma interferon using 
commercially available ELISA kits. The Interferon-alpha 
ELISA kit was obtained from ENDOGEN (Rockford, IL), 

35 the Interferon-Beta ELISA kit was obtained from RD1: 
(Flanders, NJ), and the Interferon-gamma ELISA kit was 
obtained from R&D Systems (Mirineapolois.MN)"" ELI- 
SAs were all performed according to the manufacturer's 
directions. Alpha, beta, and gamma interferon were not 

40 detected at increased levels in cells expressing intrac- 
ellular dsRNA compared to the corresponding levels in 
untreated cells. However, considerable levels of beta in- 
terferon were found in cells transfected with poly (l)(C) 
or with in vitro transcribed dsRNA and ssRNA. Alpha 

45 and beta interferon induction are associated with induc- 
tion of the RNA stress response. 

TUNEL assay 

so [0170] Cells were stained for the presence of apop- 
totic nuclei using a commercially available kit, TdT 
FragEL, DNA Fragmentation Detection Kit, In Situ Ap- 
optosis Assay from Oncogene (Boston, MA). Cells were 
stained according the manufacturer's directions. Cells 

55 were stained at 2 hours, 7 hours, 17 hours, 2 days, 3 
days, 4 days, and 5 days after transfection. There was 
no evidence of apoptosis induced by intracellular ex- 
pressed dsRNA at any of the time points analyzed. How- 
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ever, the majority of cells transfected with poly (l)(C) or 
with the in vitro transcribed dsRNA were apoptotic by 
17 hours after transfection. No evidence of apoptosis 
was observed in the untreated cells or in cells transfect- 
ed with ssRNA. Apoptosis is an end result of the induc- 
tion of the RNA stress response pathway. 

2'5'OAS activation 

[0171] The activation of 2'5'OAS was determined by 
performing ribosomal RNA fragmentation analysis. 
Briefly, following transfection, total RNA was extracted 
from cells using standard procedures. RNA was extract- 
ed at the following time points: 2 hours, 7 hours, 17 
hours, 48 hours, 3 days, 4 days, and 5 days after trans- 
fection. 5-10 fig RNA was analyzed for each sample. 
RNA samples were first denatured in formaldehyde/for- 
mamide RNA sample buffer at 65°C for 1 0 minutes prior 
to being electrophoresed through 0.5X TBE agarose 
gels. Ribosomal RNA was visualized by staining with 
ethidium bromide followed by ultraviolet transillumina- 
tion. Ribosomal RNA fragmentation was observed in 
cells transfected with poly (l)(C) and with the in vitro 
transcribed dsRNA. No fragmentation was observed in 
the untreated control cells, cells transfected with ssR- 
NA, or in cells expressing intracellular dsRNA. These 
results indicate that 2'5'OAS was not activated by dsR- 
NA when it was made intracellular^.. 2'5'OAS activation 
is associated with induction /activation of the RNA stress 
response pathway. 

PKR activation 

[0172] The activation of PKR was determined by 
measuring the levels of elF2alpha phosphorylation. 
Briefly, cells were lysed at various times after transfec- 
tion (2 hours, 7 hours, 19 hours, 48 hours, 3 days, 4 
days, and 5 days after transfection) and analyzed for 
levels of phosphorylated and non-phosphorylated elF2 
alpha. The protocol for lysing cells can be found in the 
following reference: Zhang F. era/., J. Biol. Chem. 276 
(27):24946-58, 2001 . This analysis was performed as 
described for detecting PKR phosphorylation except 
that antibodies specific for phosphorylated and non- 
phosphorylated elF2alpha were used. These antibodies 
are available from Cell Signaling Technology ( Beverly, 
MA). 

Cytopathic effect and antiproliferative responses 

[0173] Cytopathic effect was assayed by analyzing 
cells microscopically using a light microscope. Cells 
were analyzed at daily intervals throughout the course 
of the experiment. Cytopathic effect is defined as any or 
all of the following: cells detaching from surface of well/ 
flask, cells rounding up, an increased number of vacu- 
oles in transfected cells with respect to the control un- 
treated cells, or differences in morphology of cells with 


respect to the untreated control cells. No cytopathic ef- 
fect was seen in those cells expressing dsRNA intrac- 
ellularly. Severe cytopathic effects were seen in cells 
transfected with Poly (l)(C) or with dsRNA made in vitro. 
5 Cytopathic effect is associated with the RNA stress re- 
sponse. 

[0174] Antiproliferative responses were assayed by 
measuring the division rate of cells. The division rate is ' 
determined by counting cell numbers using standard 
10 procedures. Cells were counted every few days for the 
duration of the experiment. No antiproliferative respons- 
es were seen in cells expressing dsRNA intracellulary. 
Antiproliferative responses are associated with the RNA 
stress response. 

15 

Summary of results 

[0175] The results of the above assays indicate that 
intracellular expression of dsRNA does not induce the 

20 RNA stress response. The cells that were used for these 
experiments were competent for RNA stress response 
induction as was demonstrated by the ability of cationic 
lipid complexed poly(l)(C) and in vitro transcribed RNA 
to induce/activate all tested components of this re- 

25 sponse. In addition, the cells were found to be respon- 
sive to exogenously added interferon. These results im- 
ply that the cells used for these experiments are not de- 
fective in their ability to mount an RNA stress response 
and. therefore can be used as predictors for other cells, 

30 both in cell culture and in vivo in animal models. This 
method described here, which does not induce the in- 
terferon stress response, has been found to induce 
PTGS. This method therefore provides a method to in- 
duce PTGS without inducing an undesired RNA stress 

35 response. 

[0176] Although these results were generated using 
a vector that utilizes a T7 transcription system and there- 
fore expresses dsRNA in the cytoplasm, the vector sys- 
tem can be changed to other systems that express dsR- 

40 NA intracellular^. Similar results' are expected with 
these expression systems. These systems include, but 
are not limited to, systems that express dsRNA or hair- 
pin RNAs in the nucleus, in the nucleus followed by 
transport of the RNAs to the cytoplasm, or in the cyto- 

45 plasm using non-T7 RNA polymerase based expression 
systems. 

Example 12: Optimization of the concentrations and 
relative ratios of in vitro or in vivo produced dsRNA and 
50 delivery agent 

[0177] The optimal concentrations and ratios of dsR- 
NA to a delivery agent such as a cationic lipid, cationic 
surfactant, or local anesthetic can be readily determined 
55 to achieve low toxicity and to efficiently induce gene si- 
lencing using in vitro or in vivo produced dsRNA. 
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Summary of factors effecting nucleic acidl cationic lipid 
interactions 

[0178] Cationic lipid DNA interactions are electrostat- 
ic. Electrostatic interactions are highly influenced by the 5 
ionic components of the medium. The ability to form sta- 
ble complexes is also dependent upon the intermolecu- 
lar interactions between the lipid molecules. At low con- 
centrations, certain inter-lipid interactions are preferred; 
at higher lipid concentrations, rapid condensates are 10 
formed due to higher order interactions. Although local 
interactions are similar in both of these instances (e.g., 
phosphoryl groups in the DNA and the charged cationic 
head groupphe long range and inter-lipid interactions 
are substantially different. Similarly, structurally diverse 15 
variants can be obtained simply by changing the charge 
ratio of the complex by mixing varying amounts of cati- 
onic lipid with fixed concentrations of the nucleic acid or 
vice versa. This variation in the structure of the complex- 
es is evidenced by altered physical properties of the 20 
complexes (e.g., differences in octanol partitioning, mo- 
bility on density gradients, charge density of the particle, 
particle size, and transferability of cells in culture and 
in vivo) (Pachuk etal. DNA Vaccines - Challenges in De- 
livery, Current Opinion in Molecular Therapeutics, 2(2) 25 
188-198, 2000 and Pachuk et a/., BBA, 1468, 20-30, 
(2000)). Furthermore, different lipids, local anesthetics, 
and surfactants differ in their interactions between them- 
selves, and therefore novel complexes can be formed 
with differing biophysical properties by using different li- 30 
pids singularly or in combination. For each cell type, the 
following titration can be carried out to determine the 
optimal ratio and concentrations that result in complex- 
es that do not induce the stress response or interferon 
response. At several of these concentrations PTGS is 35 
predicted to be induced; however, PTGS is most readily 
observed under conditions that result in highly dimin- . 
ished cytotoxicity. 

Complex formation ao 

[01 79] dsRNA is either produced by in vitro transcrip- 
tion using the T7 promoter and polymerase or another 
RNA polymerase, such as an E. coli RNA polymerase. 
dsRNA can also be produced in an organism or cell us- *5 
ing endogenous polymerases. 
[0180] Concentrations of dsRNA, such as PSA-spe- 
cific dsRNA, are varied from 1 pg to 10 \lq. In some in- 
stances, 1 50 ng of a plasmid that encodes a reporter of 
interest (PSA) to be silenced may be comixed at a con- so 
centration between 1 0 ng and 1 0 |ig. The concentration 
of cationic lipid, cationic surfactant, local anesthetic, or 
any other transfection facilitating agent that interacts 
with the nucleic acid electrostatically are varied at each 
of the dsRNA concentrations to yield charge ratios of 55 
0.1 to 1000 (positive /negative) (i.e., the ratio of positive 
charge from lipids or other delivery agents to negative 
charge from DNA or RNA). The complexes are prepared 


in water or in buffer (e.g., phosphate, HEPES, citrate, 
Tris-HCI, Tris-glycine, malate, etc. at pH values that 
range from 4.0 to 8.5), may contain salt (e.g., 1 - 250 
mM), and may contain glycerol, sucrose, trehalose, xy- 
lose, or other sugars (e.g., mono-, di-, or polysaccha- 
ride). The mixture is allowed to sit at room temperature, 
desirably for 30 minutes, and may be stored indefinitely. 
The complexes are premixed in serum free media. The 
nucleic acid and the transfecting reagent may be mixed 
either through direct addition or through a slow mixing 
process, such as across a dialyzing membrane or 
through the use of a microporous particle or a device 
that brings the two solutions together at a slow rate and 
at low concentrations. In some instances, the two inter- 
acting components are mixed at low concentrations, 
and the final complex is concentrated using a diaf itera- 
tion or any other concentrating device. Alternatively, if 
the complexes are formed at high concentrations of ei- 
ther or both of the interacting components, the complex- 
es may be diluted to form an ideal transfection mixture. 

Transfection protocol and analysis of dsRNA stress 
response 

[0181] Complexes are added to cells that are -60 
-80% confluent in serum free media. The complexes are 
incubated for various times (e.g., 10 minutes to 24 
hours) with the cells at 37°C and diluted with serum con- 
taining media or washed and replated in serum free me- 
dia. The cells are monitored for toxicity and analyzed at 
various times for signs of dsRNA response (e.g., TUN- 
NEL assay to detect nicked DNA, phosphorylation of 
EIF2alpah, induction and activation of 2*5' OAS, or in- 
terferons Ipha and -beta). Transfection conditions that 
result in less than 50%, 25%, 10%, or 1 % cytotoxicity or 
that result in a less than 20, 1 0, 5, 2, or 1 .5-fold induction 
of a stress response are analyzed to determine if PTGS 
was efficiently induced. 

Determining induction of PTGS 

[0182] PSA protein levels are determined in cell cul- 
ture media using standard methods. The data is normal- 
ized to the number of live cells in culture to determine 
the concentrations required to induce PTGS. 

Results 

[0183] Using the above method, cationic lipid com- 
plexes of dsRNA induced toxicity at certain ranges. With 
lipofectamine as the cationic lipid, positive to negative 
charge ratios greater than 10 did not produce any de- 
tectable toxicity at any of the concentrations of dsRNA 
tested and induced a high level of PTGS, resulting in 
highly decreased levels of PSA in the culture medium. 
The RNA concentration ranges tested were 1 pg to 1 00 
ng with a constant amount of lipofectamine (10 uL of a 
2 mg/mL solution from GIBCO-BRL Life Technologies, 
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Bethesda, MD). 

Example 13: Method to avoid dsRNA mediated 
activation of the RNA stress response pathway 

5 

[0184] One or more components of the RNA stress 
response pathway can be mutated or inactivated to 
avoid induction/activation of the component(s) by dsR- 
NA that is delivered to the cell or animal for the purpose 
of inducing PTGS. These components, such as those 10 
illustrated in Fig. 4, can be knocked out singularly or in 
combination. 

[0185] Various standard methods can be used to 
knockout components of the RNA stress response path- 
way, such as PKR, human beta interferon Accession No. 15 
M25460), and/or 2'5'OAS (Accession No. NM.003733). 
Alternatively or additionally, one or more interferon re- 
sponse element (IRE) sequences can be mutated or de- 
leted using a knockout construct designed based on the 
IRE consensus sequence (Ghislain, etal. t J Interferon 20 
Cytokine Res. 2001 Jun 21(6):379-88.), and/or one or 
more transcription factors that bind IRE sequences, 
such as STAT1 (Accession number XM_010893), can 
be mutated or deleted. These methods include the use 
of antisense DNA/RNA, ribozymes, or targeted gene. 25 
knockout technology mediated by homologous recom- 
bination. One skilled in the art is able to design the ap- 
propriate antisense sequences, ribozymes, and vectors 
for targeted knockouts. For example, targeted knock- 
outs may be prepared by any of the following standard 30 
methods: Shibata et a/., Proc Natl Acad Sci USA. 2001 
Jul 17;98(15):8425-32. Review., Muyrers et a/., Trends 
Biochem Sci. 2001 May;26(5):325-31., Paul era/., Mutat 
Res. 2001 Jun5;486(1):11-9.,Shcherbakova etal., Mu- 
tat Res. 2000 Feb 1 6;459(1 ):65-71 ., Lantsov. Ideal gene 35 
therapy: approaches and prospects Mol Biol (Mosk). 
1994 May-Jun;28(3):485-95., in Gene Transfer and Ex- 
pression - A Laboratory Manual editor: Michael Kriegler, 
Publisher- WH Freeman & Co, New York, NY, pages 56 
-60,1990). ^ 40 

[0186] Knockout cells can be readily identified either 
through the use of an antibiotic resistance marker which 
when transferred to the chromosome confers resistance 
to the cell or through the use of dsRNA itself. In partic- 
ular, dsRNA (e.g., a high concentration of dsRNA) in- 45 
duces apoptosis in wild-type cells while mutant cells sur- 
vive dsRNA treatment because they cannot mount a 
stress response. Yet another approach involves per- 
forming the dsRNA-induced PTGS experiment in the 
presence of large concentrations of IRE (dsDNA) oligo, so 
which is expected to titrate activated STAT proteins. 
These oligos can be delivered intracerllularly using 
transfecting agents or electroporation. 
[01 87] In another method of preventing the interferon 
response, cells (e.g., RD cells) are transfected with a T7 55 
RNA polymerase expression vector and a T7 dsRNA ex- 
pression vector encoding dsRNA homologous to the hu- 
man protein kinase PKR cDNA (accession number 


M35663) or homologous to the coding sequence of any 
other component in the RNA stress response pathway. 
In one particular example, dsRNA corresponding to nu- 
cleotides 190 -2000 is encoded by the T7dsRNA ex- 
pression vector. The expression vectors are similar to 
those described in Example 10 and shown in Fig. 2, ex- 
cept that the dsRNA encoding sequence is derived from 
the human protein kinase PKR cDNA. Transfection in 
RD cells is performed as described in Example 1 0. With- 
in 2-5 days post-transfection, the cells are functionally 
PKR negative. 

Other Embodiments . . 

[0188] From the foregoing description, it will be appar- 
ent that variations and modifications may be made to 
the invention described herein to adopt it to various us- 
ages and conditions. Such embodiments are also within 
the scope of the following claims. 
[0189] All publication, patents, and patent applica- 
tions mentioned in this specification are herein incorpo- 
rated by reference to the same extent as if each individ- 
ual publication, patent, or patent application was specif- 
ically and individually indicated to be incorporated by 
reference. 


Claims 

1. A method for identifying a nucleic acid that modu- 
lates the function of a vertebrate cell, said method 
comprising the steps of : 

(a) transforming a population of vertebrate cells 
with a double stranded RNA expression library, 
wherein at least two cells of said population of 
cells are each transformed with a different nu- 
cleic acid from said double stranded RNA ex- 
pression library, wherein said transformed nu- 
cleic acid is capable of forming double stranded 
RNA, and wherein said transformation and said 
formation of double stranded RNA are carried 
out under conditions that inhibit or prevent an 
interferon response or a double stranded RNA 
stress response; 

(b) selecting a vertebrate cell in which said nu- 
cleic acid is expressed in said ceil; and 

(c) assaying for a modulation in the function of 
said cell, wherein said modulation identifies a 
nucleic acid that modulates the function of said 
vertebrate cell. 

2. A method for identifying a nucleic acid that modu- 
lates expression of a target nucleic acid in a verte- 
brate cell, said method comprising the steps of: 
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(a) transforming a population of vertebrate cells 
with a double stranded RNA expression library, 
wherein at least two cells of said population of 
cells are each transformed with a different nu- 
cleic acid from said double stranded RNA ex- 
pression library, wherein said transformed nu- 
cleic acid is capable of forming double stranded 
RNA, and wherein said transformation and said 
formation of double stranded RNA are carried 
out under conditions that inhibit or prevent an 
interferon response or a double stranded RNA 
stress response; 

(b) selecting a vertebrate cell in which said nu- 
cleic acid is expressed in said cell; and 

(c) assaying for a modulation in the expression 
of a target nucleic acid in said cell, wherein said 
modulation identifies a nucleic acid that modu- 
lates expression of said target nucleic acid. 

3. A method for identifying a nucleic acid that modu- 
lates the biological activity of a target polypeptide 
in a vertebrate cell, said method comprising the 
steps of: 

(a) transforming a population of vertebrate cells 
with a double stranded RNA expression library, 
wherein at least two cells of said population of 
cells are each transformed with a different nu- 
cleic acid from said double stranded RNA ex- 
pression library, wherein said transformed nu- 
cleic acid is capable of forming double stranded 
RNA, and wherein said transformation and said 
formation of double stranded RNA are carried 
out under conditions that inhibit or prevent an 
interferon response or a double stranded RNA 
stress response; 

(b) selecting for a vertebrate cell in which said 
nucleic acid is expressed in said cell; and 

(c) assaying for a modulation in the biological 
activity of a target polypeptide in said cell, 
wherein said modulation identifies a nucleic ac- 
id that modulates the biological activity of said 
target polypeptide. 

4. The method of claim 1 , wherein at most one nucleic 
acid is stably integrated into a chromosome of each 
ceil. 

5. The method of claim 1 , said method further com- 
prising: 

(d) identifying said nucleic acid by amplifying 
said nucleic acid and sequencing said amplified 
nucleic acid. 


6. The method of claim 1 , wherein said double strand- 
ed RNA expression library comprises cDNAs de- 
rived from said cells. 

5 7. The method of claim 1 , wherein said double strand- 
ed RNA expression library comprises randomized 
nucleic acids. 

8. The method of claim 1 , wherein said double strand- 
10 ed RNA expression library is a nuclear double 

stranded RNA expression library. 

9. The method of claim 1 , wherein said double strand- 
ed RNA expression library is a cytoplasmic double 

15 stranded RNA expression library. 

10. The method of claim 1 , wherein said cell is a mam- 
malian cell. 

20 11. The method of claim 10, wherein said cell Is a hu- 
man cell. 

12. The method of claim 10, wherein said cell is select- 
ed from the group consisting of a cancer cell, a cell 

25 of the immune system, a stem cell, a neuronal cell, 
a muscle cell, and an adipocyte. 

13. The method of claim 1 , wherein said nucleic acid is 
contained in a vector. 

30 

14. The method of claim 13, wherein said vector com- 
prises an RNA polymerase II promoter, an RNA 
polymerase I promoter, an RNA polymerase ill pro- 
moter, or a mitochondrial promoter. 

35 

15. The method of claim 13, wherein the sense strand 
and the anti-sense strand of said double stranded 
RNA are transcribed from the same nucleic acid us- 
ing two convergent promoters. 

40 

16. The method of claim 13, wherein said nucleic acid 
comprises an inverted repeat such that upon tran- 
scription said nucleic acid forms a double stranded 
RNA. 

45 

17. The method of claim 1 , wherein said assaying com- 
prises measuring an event selected from the group 
consisting of cell motility, apoptosis, cell growth, cell 
invasion, vascularization, cell cycle events, cell dif- 

so ferentiation, cell dedifferentiation, neuronal cell re- 
generation, and the ability of a cell to support viral 
replication. 

18. The method of claim 1 , wherein said double strand- 
55 ed RNA is between 5 and 1 00 nucleotides in length, 

inclusive. 

19. The method of claim 1 , wherein said double strand- 
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ed RNA is at least 100 nucleotides in length. 

20. The method of claim 19, wherein said double 
stranded RNA is at least 250 nucleotides in length. 

5 

21. The method of claim 20, wherein said double 
stranded RNA is at least 500 nucleotides in length. 

22. The method of claim 21, wherein said double 
stranded RNA is at least 1 000 nucleotides in length. 10 

23. A method for identifying a nucleic acid that modu- 
lates the function of a vertebrate cell, said method 
comprising the steps of: 

15 

(a) transforming a population of vertebrate cells 
with a double stranded RNA under conditions 
that inhibit or prevent an interferon response or 
a double stranded. RNA stress response; 

20 

(b) selecting a vertebrate cell in which said dou- 
ble stranded RNA is expressed; and 

(c) assaying for a modulation in the function of 
said cell, wherein said modulation identifies a 25 
nucleic acid that modulates the function of said 
cell. 

24. A method for identifying a nucleic acid that modu- 
lates expression of a target nucleic acid in a verte- 30 
brate cell, said method comprising the steps of: 

(a) transforming a population of vertebrate cells 
with a double stranded RNA under conditions 
that inhibit or prevent an interferon response or 35 
a double stranded RNA stress response; 

(b) selecting a vertebrate cell in which said dou- 
ble stranded RNA is expressed; and 

40 

(c) assaying for a modulation in the expression 
of a target nucleic acid in said cell, wherein said 
modulation identifies a nucleic acid that modu- 
lates expression of said target nucleic acid. 

45 

25. A method for identifying a nucleic acid that modu- 
lates the biological activity of a target polypeptide 
in a vertebrate cell, said method comprising the 
steps of: 

50 

(a) transforming a population of vertebrate cells 
with a double stranded RNA under conditions 
that inhibit or prevent an interferon response or 
a double stranded RNA stress response; 

55 

(b) selecting a vertebrate cell in which said dou- 
ble stranded RNA is expressed; and 


(c) assaying for a modulation in the biological 
activity of a target polypeptide in said cell, 
wherein said modulation identifies a nucleic ac- 
id that modulates the biological activity of said 
target polypeptide. 

26. The method of claim 23, said method further com- 
prising: 

(d) identifying said nucleic acid by amplifying 
said nucleic acid and sequencing said amplified 
nucleic acid. 

27. The method of claim 23, wherein said double 
stranded RNA is derived from said cells. 

28. The method of claim 23, wherein said double 
stranded RNA is between 5 and 100 nucleotides in 
length, inclusive. 

29. The method of claim 23, wherein said double 
stranded RNA is at least 100 nucleotides in length. 

30. The method of claim 29, wherein said double 
stranded RNA is at least 250 nucleotides in length. 

31. The method of claim 30, wherein said double 
stranded RNA is at least 500 nucleotides in length. 

32. The method of claim 31, wherein said double 
stranded RNA is at least 1 000 nucleotides in length. 

33. The method of claim 23, wherein each cell is trans- 
formed with at most one double stranded RNA. 

34. The method of claim 23, wherein said cell is a mam- 
malian cell. 

35. The method of claim 34, wherein said cell is a hu- 
man cell. 

36. The method of claim 23, wherein said population is 
transformed with at least 100 different double 
stranded RNAs. 

37. The method of claim 1 or 23, said method further 
comprising: 

(d) identifying said nucleic acid that modulates 
the function of said cell. 

38. The method of claim 2 or 24, said method further 
comprising: 

(d) identifying said nucleic acid that modulates 
expression of said target nucleic acid. 

39. The method of claim 3 or 25, said method further 
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comprising: 

(d) identifying said nucleic acid that modulates 
the biological activity of said target polypeptide. 
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